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Preface 



This is the last ON TESOL volume of the 70s. Daring this decade the 
TESOL profession has seen profound changes which have affected all facets 
of oar work. These new trends have made us look at the student and the teacher 
in a different Hght, from a different perspective. As is almost always the case 
, & education, these changes have been gradual, and often subtle* Those of m 
who have been involved in TESOL for a number of yean, however, look back 
at the first dams we taught, the first materials we used, die conferences we 
attended, the journals we read, and find that the entire foo\s has changed. 

The 70s constitute an era of many questions and few answers. It is die. 
questions, however, that have made our task fascinating and challenging, These 
questions have a common core: how do people learn second languages? We 
now feel that what we teach and how we teach should revolve around that stfil 
unanswered baric question. Consequently, most of us try to be non-dogmatic 
•bout mattbrs of method and technique because we feel that until the answer 
is found, flearfbiKty snd open-mipdedness are the only sensible alternative* 

Bat teach we must, and while we wait for language learning research to 
provide us with the final solution, we, in the classroom, are also focusing on 
the le arners, We carefully consider their needs and goals, their likes and dis* 
~ttr^ their strengths and weaknesses. As a result, second' language teaching 
today is much more learner-centered than ft was tat years ago. 

, ON TESOL 79 attempts to capture this shift hi focus by concentrating on 
the main issues of this past decade: second language acquisition research, new 
trends in teaching and ovriculum design, current concerns in testing, and 
novel approaches to teacher training. 

Each one of the plenaries in Rut I, presents a perspective of one of the 
major areas of concern. Peter Stevens shows die enormous diversification which 
has resulted from a narrower focus em the learner: ESL, EFL, BSD, ESP, EAP, 
EST, etc Joan Moriey discusses die "new frontier* in die development of 
teaching materials. Merrill Swain gives us an overview of research hi bilingual 
education. Darlene Larson's panel (Roger Shuy, John Oiler, Ernest Bernal, 
snd Paul foster} deals with the important issue of bias in testing. Finally, 
WUga Rivers, Mary Hines, Tom Scovel, and Barry Taylor explore three teach- 
ing methods which have come to our attention in the last cfecade: Hie Stent 
Way, Suggestopoedia, and Counseling Learning/Community Language Learn- 
ing. 

Ftot II, Curriculum Design and Implementation, deals with issues that bear 
directly/ upon teaching. Part III covers most of the testing issues that have 
concerned us this past decade; tots, ckze procedures, and die design of new 

FRir Ml 



tests. Part IV, Teacher Preparatidn, focuses on the teacher as learner and 
shows the i^eed for new approaches to training. Finally, Part V, Research, 
contains papep that explore several areas of interest to teachers, curriculum 
designers, materials developers, and researchers: Communicative competence, 
discourse analysis, and research in the area of writing skills. 

Space limitations have made it impossible for us to include more of the 
many excellent papers presented at the Boston Conference. We trust that this 
selection, however, reflects the concerns of the profession at this time of transi- 
tion and change, and we hope that this volume will contribute to developments 
in the 80*s that will answer some of the questions raised. 

-C.A.Y. 
K.P. 
IS. 
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Differences in Teaching For Different Circumstances 
Or the Teacher as Chameleon 

\ 

Peter Strevens 



The Bill Educational Tryst 

The purpose of this paper is to support a basic hypothesis about effective 
language learning and teaching. The hypothesis is that to be effective in promot- 
ing learning, teaching must take account of a large number of variables. The 
topic will be divided into three sections: first, the way In which our professional 
perceptions of the variables in language te*?hing have changed over the years; 
second, an analysis of the variables that seem to operate; and third, some con- 
sequences of these changing perceptions ind these variaUe5 

1. Our changing perceptions of the variables 

Very few of us s§e more than a small selection of the main variables-just 
those variables, in fact, which operate with obvious effect in our own particular 
circumstances. What is more, even if all teachers agreed in their understanding 
of those variables they con'd observe and identify— which they don V-it is only 
by pooling and comparing their observations and their responses tnat we grad- 
ually establish a full perspective of the complex activity we are ynll engaged in. 

In this regard we have been, in the past twenty years, somewhat in the 
position of those Blind Men of Indostan in the poem by J<flin Godfrey Saxe 
(an American poet of the early 19th century), who encountered an elephant 
for the first time. 1 ' 

» ^ THE BUNp MEN AND THE ELEPHANT 

A Hindoo Fable 

It was six men of Indostan ** The fhird Approached the animal, 

To learning much inclined, And happening to take 

Who went to see the Eleptnnt The squ rming trunk within his hands, 

{ Though aD of them were blind ), Thus boldlv up and spake; 

That each by observation "1 see. quoth he, the Elephant 

" Might satisfy his mind. Is very like a snake! 

The First approached the Elephant, The Fourth reached out an eager hand, 

And happening to fall u And felt about the knee. 

Against his broad and sturdy side What most this wondrous beast is like 

At once began to bawl: Is mighty plata' quoth he; 

"God bless me! but the Elephant u Tis clear enough the Elephant 

Is very like a wall I" Is vervMike a Me! 

The Second, feeling of the tusk, The Fifth, who chanced to touch the ear, 

Cried, M Ho! what have we here Said: E en the blindest man 

So very round and smooth and sharp? Can tell what this resembles most; 

To me *tis mighty clear Deny the fact who can, 

This wonder of an Elephan This marvel of an Elephant 

* q- Me a spearr 1* very like a fan! 

ERIC n ' 



The Teacher as Chameleon 3 

We perhaps are in a much happier position than Saxe's wise men ^because 
our 'elephant' has been reported, observed, photographed and subjected* to 
minute analysis and discussion by annual Conventions for the past dozen years. 
As a result, we share a more comprehensive and accurate understanding of 
what an elephant is like than Saxe's blind Indostanis could muster— though at 
the same time we are increasingly aware that there is not just one elephant, 
but whole families ami populations and even sub-species of elephants, and that 
therefore the scope of our study gets ever wider, even as our comprehension 
of it grows. 

You will recall that one reason for the complementary analyses given by 
Saxe s blind men was that they each observed the elephant from a different 
standpoint. It is interesting to notice that a whole sequence of different view- 
points about the nature of our own profession, TESOL, is embodied in the 
sequence of acronyms or initials that have been employed on both sides of the 
Atlantic. Hie proto-TEFL term, as it were, was probably ELT (English lan- 
guage teaching), the original undifferentiated term used in Britain as early as ^ 
1945 and still employed today, for example in the title of the British ELT 
Journal, founded in 1948. Before long it became necessary, especially in British 
ELT, to distinguish between ELT in circumstances where the language had a 
special historical status in the community, fur instance, English in Nigeria, or 
Hong Kong, or Eiji — (where it is referred to as a second language, hence ESL), 
and on the other hand ELT in circumstances where the language has no special 
standing and is not in widespread use — (where it is a foreign language), as in 
Japan, or Brazil: hence EFL. In British usage, when referring to the teaching 
of English to foreign students visiting Britain, the term TEFL became universal 
because such students almost always come from and return to 'foreign language* 
countries. To sum up the distinction, for us in Britain ESL indicates sociolin- 
guistic conditions m a foreign country which compel significant changes in 
teaching English there. 

In the United States, though, I sense a great deal of free variation, un- 
certainty, even change of meaning, in the use of the terms TEFL and TESL. 
For a given activity, sometimes the one is used and sometimes the other. There 
are perhaps two reasons for this: first, the American profession has been rela- 
tively little engaged in ESL in the British sense, i.e., in former British countries, 
so that this EFL/ESL distinction has had little relevance; second, the issue has 
been clouded by multiple meanings for the word 'second*. By talking of a 



The Sixth no sooner had begun 
About the beast to grope, 

Than, seizing on the swinging tail 
That fell within his scope, 

M I see," quoth he, "the Elephant 
Is very like a rope!" 

And so these men of Indostan 
Disputed loud and long, 

Each in his own opinion 
Exceeding stiff and strong, 

er|c 



Though each was partly in the right, 
Ana all were in the wrong! 

Moral 

So oft in theologic wars, 

The disputants, I ween, 
Rail on in utter ignorance 

Of what each other mean, 
And prate about an Elephant 

Not one of them has seen! 

John Godfrey Saxe, 1816-1887 
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'second language' — meaning the next one learned after the mother tongue — 
and by talking of a 'second foreign language* (or a third, or fourth) — for exam- 
ple, Spanish, French, or German in the school system— we have! encouraged 
semantic ambiguity. By i^ontrast, the name of our organisation, TESOL, is clear 
and unambiguous in the way it unites all branches of the profession. 

What terms came next? There is some uncertainty about the chronology. 
My own understanding is that the next development occurred when the special 
and contentious problem arose of teaching English to children of the black 
community in the United States: was it to be regarded as 'mother- tongue' teach- 
ing? Should it be handled as a branch of TEFL? Black English became ac- 
cepted as a dialect of English to be reckoned with in educational terms, and 
in this way ESOD or TESOD were bom— and even ESOLD and TESOLD: 
teaching English to speakers of other languages and dialects. 

Yet that is not by any means the end of these subdivisions within our own 
profession. The term ESP (English for specific purposes) has come into uni- 
versal use to designate the teaching of English with particular restrictions on 
its aims, content or skill objectives. And within ZSP, EST refers to English for „ 
science and technology, a particular sub-set of ESP which entails special learn- 
ing features (and special teaching requirements, too) since the scientist or 
engineer has to learn 'the Janguage of science' — whatever that is. The British 
Council favours a further internal distinction within ESP: EAP (English for 
academic purposes) and EOP (for occupational purposes). 

There have been other TEFL-terms: ElAL (English as an international 
auxiliary language) had a brief existence in Hawaii before being dropped in 
favour of an INTER/INTRA distinction (English for international puiposes, as 
in Japan, Brazil, etc. and English for intranational purposes, as for large popu- 
lations of people in India, Malaysia and Singapore, and increasingly in other 
historically English-using countries ) . 2 

Two other terms now in constant use have not been turned into acronyms 
or initials. In the United States, 'Bilingual Education is a specialised and 
established branch of English teaching; while in Britain we refer to Immigrant 
Teaching, which is that special branch of teaching English to the children of 
immigrants within the framework of the ordinary State school system — in fact, 
as a sort of TEFL-flavoured mother-tongue programe. 

The lesson to be learned from this sequence of TEFL terms is that in the 
short space of twenty-five years our perceptions of some major variables within 
our own- profession have repeatedly grown sharper and more delicately^ differ- 
entiated so that our existing wky^ of referring to what we do iave become 
inadequate and have required the acceptance of a dozen or'trreR specialised 
terms. And there will undoubtedly be more distinctions made in the future. 

Yet the terms we have discussed above are only macro-variables. Beyond 
these macro-variables are a whole rarv?e of other choices, distinctions, alterna- 

1 See particularly Smith (1978) and Strevens (forthcoming). 
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The Teacher as Chameleon 5 

* Mves, divergences, and these are variables whose operation affects our daily 
professional activity. 

2* An analysis of the variables 3 

When we attempt a reasonably comprehensive survey of the many variables 
which in total affect our profession, some teachers will already be aware of 
all the factors I shall mention, and more; to them, my apologies. Others, espe- 
cially teachers who l|ave always worked in one particular branch of TEFL, 
will be like Saxe's Blind Men of Indostan because they will have seen some 
of the variables in action while being unaware, perhaps, of others; teachers 
with different experience will recognise a different selection, depending on 
which part ef the TEFL elephant's anatomy they have been concerned with, 

TEFL teachers who. rrtove from one kind of teaching to anothejr,-«~-from 
teaching 'general* TEFL tu adults in the United States or Britain, to teaching 
children in Colombia, to teaching intensive ESP for medical staff in a country 
* on the Arabian Culf, to university TEFL in Scandinavia, .to teacher training in 
Singapore-^-quickly discover that there arfc indeed a great many variables at 
work. Let us look af five types: 

1. Some of these variables appear as restrictions upon the teachers are actually 
permitted to do: for example, in cultural matters. In some societies, for 
inAnce, the kind of boy-girl relationships commonly found in TEFL text- 
books, illustrations of girls in short dresses, reference to alcoholic drinks, 
mention of dogs as domestic pets, even relaxed delations between teachers 
and students if they are of different sexes all may be culturally unacceptable. 

2. Other variables will appear as limits on what is physically and organisation- 
ally possible: very large .classes; classroom furniture bolted to the floor 
(which severely restricts communicative activity and work in groups); in- 
adequate numbers of classes per week; a school year that is nominally of 
30 teaching weeks but in practice may be of 20 weeks or less; lack of central 
authority or such draconian central authority that any deviation from normal 
practice, even creative experiment by intelligent teachers, becomes almost a 
criminal offence; absence of suitable textbooks, tef riling aids or equipment, 
even of blackboards and exercise books. 

3. Some variables will affect the teacher through standards of teacher training 
where the national average level of professional competence has pervasive 
consequences on what can be achieved in the classroom. 

4. Yet other variables relate to sooioUnguistic attitudes and expectations. If 
English is an unpopular language in a given place (e.g. in rural Quebec at 
present), teaching it may be ineffective for the learner and unpleasant for 
Ibe teacher; if a community normally expects that its citizens will learn 

8 The identification of variables has a jong history. The analysis in this paper is concerned 
more with emphasising the scope and diversity of types of variables rather than with com- 
piehensivity in detail. It builds on the >fork of very many specialists. But one work abo^e 
should be singled out for its subtlety and wide coverage of variables jn the learning and teach- 
ins of languages, i.e. Maekey (1965). ; 
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- English j(c.jj. in Holland, or Scandinavia) they generally do so, but con- 
versely, low expectations (as of learning foreign languages by English school 
children) are fulfilled by "generally poor standards pf achievement. 
5. The Educational framework of TEtfL, too, may vary. In some places the 
teaching of English begins at age 7 or 8 and continues as a part of normal, 
liberal arts education, for ,up to 10 or 11 years; elsewhere it starts only at 
age 14 and lasts for 4 years. By contrast, the -framework for ESP is almost 
always tftat of adult education. It is often independent of the rigid admin- 
% istrative matrix of general education and is usuajly open for methodological 
originality— i e. ESP is not tied to any particular method or r aterials. As 
a result the^successes and failures in ESP are more directly : used by the 
teachers, as distinct from the system, than is the case in a general educa-/ 
tional framework where the system often casts the variables into a mould 
which makes, success difficult to achieve, save by exceptional teachers 

Five sets of variables have been mentioned so far: restrictions on teach- 
ing, physical and organisational constraints, standards of teacher training, so- 
ciolinguistic attitudes, the educational framework. They can all. be roughly 
classified as derived from the community, that is to say, from the mixture of 
sociolinguistic conditions and the current realities of educational administration. 
Teachers in Britain or America or Australia are of course equally subject to 
constraints of this type', but in those countries teachers tend to be unaware of 
them and* only realize their existence on first working abroad, or occasionally 
at home wh«*-*ome really major reform is being fought through our various 
of 'legislative and administrative machinery. 
From variables ascribable to the community and generally acting as con- 
straints on the teacher, let us turn^to other variables more directly under the 
•'control of teachers. It is necessary to do thjs in order to remind ourselves that 
although teachers work to a large^sxtent within a matrix of conditions that are 
uot of their' choosing .and ar only u 14 imally under* their control, they do have 
the responsibility for selecting* among a wide range of further variables. 

The most iibvious sets of teacher-cc 'rolled variables fall under the twin 
headings of syllabus and methodology , i.e the large and growing range of 
instructional techniques available* to the teacher.^ methodology) and the prin- 
ciples for selecting and organising-thc^ content to 6e taught- ( syllabus design). 

The two are inter-linked. Modern ^ews^bout the importance- of the learner 
lead us 'to analyse the learners needs in ever-greatei* d .ail, to establish \yith 
maximum certainty his precise aims and objectives; io specify the/ language 
and other content he will require; to organise- the sequence in whiehjt is most 
appropriate to teach this content to this learner; to consider what teaching 
techniques can most f .icctiyelyf be used by the teachers available; and to teach 
this content in this sequence to these learners with these needs: 

The first two parts of that ptocess, analysing the learners needs and 
objectives and determining appropriate content and sequence, fom part of 
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syllabus design. This is a process where members of the teaching profession 
Consider several dimensions in order to establish the best possible 'fit' or match- 
ing. What are these dimensions? Here are some of them: the identity of the 
lear rs and their principal characteristics (age, educational level, stage of 
proficiency already reached, etc.); their aims and objectives (to pass a par- 
ticular examination, to achieve practical oral communication Math certain types 
of people, to read with understanding certain kinds of written text, etc.); the 
language appropriate to those objectives (which may either be general English 1 
or may be some defined sub-set of English, like 'English for Indian village 
tropical health projects', or 'English for air traffic control', etc.) any special 
need for some items to be ordered before other items (e.g. in an ESP course 
for ships' officers, to teach compass directions and bearings early on, where in 
contrast they might be taught late or never in a course for specialists in tropical 
medicine). ** 

The syllabus designer will bear in mind several different aspects of the 
content: linguistic, situational, notional, functional and communicative. Lin- 
guistic aspects: the language content to be learned in terms of grammar, yoca- 
* bulary, pronunciation, formulae and fixed expressions, idioms, and so forth; 
situational aspects: how *o make the language content more interesting, more 
easily learned and remembered, and more*iike real life by teaching it in relation 
, to familiar or imaginative situations, like* at the Post Office, telling the time, 
i going t< school, and so forth; notional aspects: ways in which various funda- 
mental notions about the, universe are expressed in English notions of time, 
of place, even of case, (i.e. who does what, who to, aud with what result);* 
functional aspects: how English expresses certain funclions of language, such 
as negation, possibility anJ impossibility, description, questioning, judgements, 
and many other*; and communicative. aspects: how ta use English for meaning- 
ful, deliberate, effective communication between human begins, including dis- 
coursal and rhetorical rules. 

Methodology, too, is a variable whose manipulation is normally open to 
the teacher, not imposed upon him or her by outside authority/ The term in- 
cludes both strategic and tactical decisions. Strategic decisions include such 
questions, as: the balance of importance and time to be given to the various 
skills of speaking, reading etc.; the extent to which overt use is made of gram- 
mar, and what kind of grammar it will be; the extent of individualised pro- 
cedures, if any; the relation betw* i the teaching* syllabus and any terminal 
a cessment or examination; the ( " reliance upon class work and private 

study; the use of language lab» ^ ^er aids; ^nd many others. Tactical de- 
cisions include: the selection of particular teaching techniques at particular 
points in each course and ea Ji lesson; the flexibility of teachers to change their 
techniques from moment to moment according to the learning paths of the stu- 
dents; the encouragement of methodological originality and creativity on the 
. part of the teacher (and even the learner, on occasion); and many more choices 
of this limited kind. 
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Compared with the central importance ot syllabus design and methodology, 
the variables of materials production and evaluation are perhaps secondary, 
even though their impact 8n the daily classroom life of the teacher and the 
learner is often considerable. There i$, however, one further variable on the 
teaching side (as distinct from the community side) which must not be over- 
looked: teacher training. To the* individual teacher, teacher training is usually 
looked at in terms of preparation for his or her personal career. The individual 
learner rarely even considers the question, except perhaps to reassure himself 
that he is being helped to learn -by someone who is at last not an amateur — 
much as the patient awaiting surgical treatment likes to be certain that he is 
not in the hands of a do-it-yourself hobbyist. Yet on* a larger scale the extent, 
nature and quality of the teacher training available in a given country will 
crucially affect the quality of teaching that is normally given there; and it will 
also largely determine which choices are made uncjgr the variables of syllabus 
design and methodology. So when we observe different kinds of teaching and 
different standards of teaching taking place under different conditions we 'can 
be sure that the kind of teacher training undergone by the teacher will be a 
major determiner .thereof ; and we can also be certain that if we are seeking 
ways of modifying the teaching and learning in a given place,, we -shall be 
bound to include the teacher training system in our calculations. 

The variable of teacher training embraces many elements. If one accepts 
that teachers are by definition members of the educated sector of the commu- 
nity (and that the notion of fin uneducated teacher is, or ought to be, a logi- 
cal Contradiction) then a prime element in teacher trainirg concerns the level 
of personal education possessed by the teacher trainee. A case can be made, 
1 for the view that teaching, being a matter of close and continuous interaction 
between individual human personalities, also requires a high level of emotional 
maturity and stability. Then one must consider the balance between the three 
main components of a teacher training course: a skill* component which de- 
velops practical, instructional techniques, both those common to all branches 
of teaching and those that, are special to TEFL, including an adequate com- 
mand of the language he or she is teaching; an information component, in 
which the teacher takes in the very considerable body of knowledge about 
education, teaching, language, English today, sociology, psychology, the or- 
ganisational framework s'he is working in, and much more; and in addition a 
theory component, which provides him/her with an intellectual basis for know- 
ing not just what to teach and how to teach it, but also why to teach that rather 
than something else. 

Let me offer just one example of the kind of information that ought to be 
included in TEFL training courses. It concerns the astonishing changes that 
are taking place in the spread and functions of English on a global scale. Young 
teachers need to Le informed about the way that English is now widely re- 
garded, abroad, as an international possession; it is no longer the cultural prop- 
erty of the British, the Americans, the Australians and New Zealanders. Teach- 
9 
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ers need to realize that in a growing number of countries ° India, Singapore, 
parts of Africa, English is used by vast and growing po| 'ions of people (26 
millions in India alone) who never meet us native speak*" .> and who have no 
desire or need to model their English on ours. We have to get rid of our mono- 
lingual ethnpcentricity and accept the existence of a great many localised forms 
of English, one characteristic of which is precisely 1 hat they are different from 
British and Australian and American English. There are many other kinds of 
information to be included: this may serve simply as an example. And yet, as 
we shall see later, even an ideal teacher training programme supplied to every 
TEFL teacher at the outset of his or her career fails to meet all the needs of 
the profession. 

We must npw look briefly at one other set of variables t|iat affect our teach- 
ing : variables relating to the learner. To list them all would be a major task. 
We nd>y recognise that TEfc'L learners are human beings (as distinct from 
laboratory animals) and that w\:!** they share certain universal human char- 
acteristics they also each carry a wide range of strictly individual features. How 
can we distinguish those that are significant from among the mass of learner 
characteristics? 

TTiere are three learner variables above all, it seems to me, which lie at 
the heart of the interaction betweA learner and teacher and determine how 
effectively the learner learns. The^^are: the learners reasons for learning, his 
or her attitudes towards learning, a id his or her expectations of learning. 4 

Learners reasons for TEFL le. A in'ng vary very greatly. For young children, 
it is normal and enjoyable to do what teacher says; for adolescents reasons for 
TEFL learning are often founded on h*?zy notions of 'relevance* and on emo- 
tions of love or hate towards the teacher, and they are frequently negative 
reasons, so that learning does not occur; for adults, the reasons are generally 
quite definite and instrumental. 

Learners' attitudes towards TEFL learning are a compound of their at- 
titudes towards learning in general, towards English in particular, towards the 
teacher and the textbook, and even towards "themselves as learners. 

Learners expectations of TEFL learning are based partly on their past his- 
tory and experience of language learning, partly on opinions fed to them by 
their family, their friends and their teacher, and partly on the current folk-myth 
about whether his/her community normally expects people to learn English. 

As far a$ teachers are concerned, they quickly discover two things: first, 
the success of their own efforts in the management of learning and of their 
learners* efforts in achieving their optimum learning rate (Streve^s 1977b), 
depends heavily 011 these reason* attitudes and aipectations. Second, much of 
their time and thought is devoted to unobtrusive efforts to make these factors 
as positive as can be achieved. 

4 1 am deliberately avoiding the term motivation, partly because it has different meanings 
for different people, but chiefly because motivation unbraces a large number of factors 'Hie 
3-term analysis into reasons, attitudes and exp ectations is made from the standpoint of the 
teacher rather than on a basis of psychological >r sociological analyses. 

o 
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To sum up this penultimate section, then, ^ithin the totality of TEFL there 
exists a la%e number of variables)^ identifiable broadly as relating to the com- 
munity, the teaching profession itself, and the learner. In addition, the nature 
of the profession's perceptions about its own task changes and develops gradu- 
ally with the passing of time. 

In short, the teacher is like a chameleon. To be efficient, i.e. to survive, the 
chameleon must continually observe his surroundings and adapt to them. And 
adapting means action. It means selecting from among a range of possible 
colourings ^nd patterns precisely those which are appropriate to the moment. 
. The teacher, to be efficient, perhaps even to survive, needs to adapt in a similar 
way: to select from among a wide range of possible techniques and courses of 
action precisely those which are appropriate to the circumstances of the learn- 
ing/teaching situation. In the third and final section of my paper I shall com- 
ment on the responsibilities of the teacher and of the profession. 

Teaching is a hard and emotionally bruising occupation, and language 
teaching more so than most other specialisms, especially if the language one 
is teaching is not one s mother* tongae. Teachers essentially need two distinct 
kinds of follow-up to be available to them after and beyond their initial train- 
ing. One kind of follow-up must be the career-long provision of morale-boosters 
—of events where teachers can meet together, share ideas, renew acquaintance 
- with success, seek professional commiseration with our Httlj deprivations, and 
generally break free of the frustrations and annoyances of the daily round (the 
TESOL Convention is the outstanding example of this kind of event); also 
teachers* associations, workshops, conferences, magazines and journals, etc., and, 
for teachers for whom English is not the mother tongue, opportunities to restore 
and improve their command of the language* 

The secdnd kind of follow-up is of a different kind and affects only a pro- 
portion of teachers, though again it is "essential to the onward development of 
-the profession as a whole. This is the availability of courses of farther training, 
with the purpose of developing an adequate higher echelon of senior profes- 
sionals, teacher trainers, advisers, and consultants. 

3. Consequences of this analysis 

The argument so* far jJresented is basically a simple one. It is that the pro- 
fession we belong to is now a fpeat and global one, of vast scope and influence 
and containing enormous complexities and subtleties. No single one of us, as 
a teacher, encounters all the possibilities of choice. Indeed, most of us live our 
working lives within a single, fairly simple segment of the whole; we^reusu- 
ally awarl of only the big toe of the TEFL elephant, or the tip of its tusk, or 
its tail, or its trunk. Br.t just as the elephant exists as a complete and complex 
animal even if we can see only a part of it, so also TEFL exists, ^and is subject 
tcT all the many variables we have been discussing. * 

But we are teachers. We are members of the to'il profession of humaif 
education. And that places before us the responsibility for knowing what we 
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are doing, for being aware not just of our own little segment of the profession, 
but (at least in broad outline) of thje whole field, of the variables that now 
exist and of their consequences. What are these consequences? 

The first and most obvious consequence is that any lingering nostalgia for 
a single 'best* method is completely mis-placed. No single method can con- 
ceivably be equally suitable for all values of all the variables, for all learners, 
of any age, regardless of aims, of attitudes, of level of proficiency, and so forth. 8 

A second consequence is that the individual teacher needs to acquire trjp 
widest and deepest understanding of all the variables he or she is likely to 
encounter in the language learning/teaching situation where s/he will be work- 
ing. The teacher needs to select, devise and operate, for any given situation, 
that methodology which has the best fit, the closest match, with all the variables* * 

Behind the teacher is a third and final consequence of the variables facing 
us. This is a consequence for the TEFL profession. Now that TEFL is so vast, 
so complex, yet so important, it ha& become essential for the profession to have 
corporate ways of observing, monitoring, analysing and influencing its own 
development Hence the vital importance not only of academic centres of ex- 
cellence such as university departments and teacher training colleges, not just 
of resource centres like the Centre for Applied Linguistics, the Regional Eng- 
lish Language Centre in Singapore, the British Council's English Teaching In- 
formation Centre in London, and similar centres elsewhere, but also of ^profes- 
sional 'teachers organisations— of TESOL, IATEFL, AILA and the rest— and 
of their programmes of activities. 

This, (ben, is the link between the relatively drab practical daily activity 
of our working lives, and the brightly-coloured programme of a TESOL Con- 
vention. Her3 is where we learn from each other about variables other than 
tile ones we know of and meet in our ordinary life; here is where we exchange 
experiences, and wher- we discover that the wall, the spear, the snake, the 
tree, the fan and the rope, are in fact part of an elephant. Our profession be- 
comes daily even more complex, subtle, sophisticated. We as teachers musf learn 
to adapt, ,chameleon-Iike, to an ever-greater array of variables, so that w£ can 
offer to our students not a single technique which may or may not be effective, 
but the best possible choice of teaching for the particular variables that operate 
in our own students' individual circumstances. 



5 J I have any reservation of substance about the claims made for the Silent Way, Corr- 
muMty Language Learning and Suggestopedia, it is that they seem to claim to be suitable a 
all ponces. 1 find that to be counter-intuitive, and against experience. 
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1. Introduction 

Materials development in ESL has many dimensions. 1 Reflect for a moment 
on the following concerns expressed by respected colleagues. Each addresses 
a different aspect of the art/science of developing instructional materials for 
second language learners*- In later sections cf the paper special reference will 
be made to each of these comments. 

The higher the general standard of the teachers, the less important are the course- 
books and other teaching materials. But when the standard of teachers is low, as it 
inevitably is for a time in conditions of rapid expansion of education, then the stand- 
ard of teaching materials assumes great importance. This is what is meant by 'the 
teachers/materials equation/ I nsu fficien tly-traWd teachers working with poor ma- 
terials face a considerable handicap. When this occurs it is a defect of the system, 
for whose remedy a three-way collaboration is necessary, between publishers, ad- 
ministrators, and teachers. ( Strevens, 1977a ) 

If authors of textbooks of the future are to respond to all of the significant changes 
taking place Janguage pedagogy, they cannot overlook the social climate in which 
learning tataf mce. In humanistic classrooms — and there are more a'nd more of 
them — people rake responsibility for their own learning. (Dubin, 1978) 
Efficient language teaching must work with, rather than against, natural processes, 
facilitate and expedite rather than impede learning. Teachers and teaching materials 
must adapt to the learner rather than vice versa. (Corder, 1976) 
Every teacher is an adapter of the text used. The more experienced and sensitive the 
teacher, normally the greater the adaptation. We're adapting even when we refer 
\ to an exercise covered earlier or when we complement the lesson with realia, music, 
\r unprescribed student activities. (Madsen and Bgwen, 1978) y 
. N. adaptation Is invitable; it ought therefore fir receive more attention and more 
prestrge than it usually does. . . . language study is inevitably a total human ex- 
perience; writers and teacjiers ought therefore to act as though it is, ( Stevick, 1971 ) 

% tlgnificarit developments affecting ESL 

Twelve years ago TESOL Was established as an independent organization 
dedicated to the professional study of teaching English to speakers of other 
languages and dialects. Since trjat time many significant developments have 
taken place in the field of second language learning and teaching in related 
disciplines in world affairs. 

* The phrase "materials development" is to be interpreted with the broadest of definitions 
throughout the paper unless otherwise specified. 
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In the following section three specific developments will be discussed. Each 
has had a particularly strong impact on the field of ESL. Their direct influence 
on materials development in ESL will be examined. 

1. The ESL "population explosion". Within the current world population of 
around four and a quarter billion people it is reported that around 374 million 
speak English as their primary language. Added to this, and of particular con- 
cern to TESOL, an estimated 305 million persons use English as their secondary 
language. This number continues to grow yearly, as does the growth-rate figure 
The result, world-wide, is a burgeoning "population explosion" of ESL students 
— students who need eFective instructional programs carried out by well- 
trained ESL teachers. m 

Decker (1978) has compiled the following information on English-language 
use: (1) English is the most commonly taught foreign language in the schools 
of countries where it is not the mother tongue of the inhabitants, including the 
U.S.S.R. and the People's Republic of China, (2), in the most significant tech- 
nical and critical fields — science, education, commerce and trade, technology, 
medicine, engineering, international politics, and journalism — there are more 
publications in English than in any other language, (3) more books, newspapers 
and magazines, radio and television programs, records and tapes, and films are 
produced in English— and distributed more widely — than in any other single 
language, (4) English is the most commonly used language at international 
conferences, (5) English is the official language employed in international air 
traffic and in international shipping, (6) three-quarters of the world's mail is 
written in English. 

While estimates of the numbers of persons who use English as their sec- 
ondary language are well over 300 million, often documented information on 
actual numbers of students who male* ap individual ESL populations is dif- 
ficult to obtain. In the United States, m fact, the National Center for Educa- 
tion Survey ( NCES ) only recently completed studies whiclj provided data on 
the* numbers of learners who make up various segments of the ESL population, 
on the numbers of teachers involved, and on the nature of teacher qualifica- 
tions. 

The NCES survey supplied significant and sobering information in three 
areas which are of particular concern to ESL professionals: (1) a totaling and 
subtotaling of the enormous numbers of present potential ESL learners, (2) 
an accounting of the alarming deficit in the number oi minimally trained (let 
alone weli-trained) ESL 'eachers available, and (3) a specification of gross 
inadequacies in the teacher preparation of teachers who are pressed into service 
as ESL inst/uctors Dorothy Waggoner, (1978) Education T ugruni Specialist for 
NCES V reported the following survey information. 

An estimated 28 million persons in the United States (one in eight) have non- 
Enghjh language backgrounds, «md contrary to genet al belief, most are native-horn, 
not foreign-born. 
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Over 5 million, of the 28 million who have non-English language liaekgrounds, ar2 
of school age, between the ages of six and nineteen — over ten percent of the persons 
in this age group. 

An estimated 2.4 million persons, aged four and older, do not speak English at all. 

Waggoner also reported the following mind-boggling gap between teacher need 
and teacher preparedness: 

In the public schools in the United States during the school year 1976-1977, out of 
an estimated one hundred and two thousand teachers (102,000) actively engaged 
in teaching English as a second language, only THREE out of TEN had taken even 
one course — one course — in teaching English as a second language. 

Peter Strevens' (1977a) observations are directly and disturbingly applicable 
here: **. . . when the standard of teachers is low, as it inevitably is for a time in 
conditions of rapid expansion of education, then the standards of the teaching 
materials assumes great importance. Insufficiently-trained teachers working with 
poor materials face a considerable handicap/' 

The serious state of affairs revealed by the NCES survey is a clear instance, 
indeed, over 70 thousand instances where the quality of the learning materials 
assumes critical importance. Finally, as the number of ESL learners increases 
and the need for more and better prepared ESL teachers intensifies, there is an 
additional feature of ESL population growth which must be taken into serious 
account — the special purpose English-language needs of students in an increas- 
ing variety of particular circumstances. These needs must be analyzed and ap- 
propriate learning objectives formulated in order that the most useful instruc- 
tional programs and learning materials can be designed. Clearly, the challenge 
to the ESL profession is a demanding one which continues to expand into new 
dimensions. Our responsibility is, as G. Richard Tuckers closing plenary state- 
ment charges (1977), '\ . to provide effective second language training to an 
increasingly larg^ and heterogeneous group of students". 

In relation to the topic of this plenary session then, the need to provide 
not just more, but high quality and increasingly specialized learning materials ( 
is a central responsibility of our profession and must become a priority concern. 

In many ESL learning/teaching settings around the world the written texts 
and recorded tapes exert a powerful and unquestioned force. This is especially 
true -whenever and wherever teachers of English as a second language have had 
neither broad study of language and language-learning, nor thorough training 
in ESL. Indeed, in the Unitf^l States alone, if we consider the serious impact of 
the one Waggoner figure (i.e. of 102 thousand teachers assigned to teach ESL, 
seven out of ten are totally untrained in ESL) then in the classrooms of over 
70 thousand teachers who have never taken even one course in ESL, the text- 
book may well be responsible for teaching the teacher as well as the student. 

2, Tht> "knowledge explosion*. During the past two decades there has been 
an extensive and wide-ranging increase in professional study relating to theory 
aid research in the social and behavioral sciences. Some have called it a 
"knowledge explosion". 

$r 21 
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Developments in psychology and linguistics have been of major importance 
to the field of second language learning and teaching, especially those in the 
emerging interdisciplinary fields of sociolinguistics; psycholinguistics, and Meuro- 
linguistics and particularly, studies in first language acquisition. 

Beginning in the late 1960's and continuing to the present time, there has 
been a smaller ^explosion of information, but a significant and respected one, 
related directly to theory and research in second language acquisition. Increas- 
ing numbers of conferences papers, journal articles and new books appearing 
on publishers' inventories are a measure of the emerging interest in this field of 
specialized study. 

A survey of TESOL convention programs reveals the.*growing prominence 
of second language acquisition study within our own organization. Compare: 
Ten years ago, in 1969, at the third convention of TESOL, (Chicago), out of 
nine pre-convention study groups, and 83 convention presentations, one papeiL 
dealt with a single facet of second language acquisition theory and ^search? 
Four years ago, in 1975, at the ninth annual convention, 'Los Angeles), one 
out of 36 pre-convention sessions and 22 out of 161 convention papers dealt 
with a variety of facets of second language acquisition. And in 1979, at the 
13th convention, (Boston), out of a total of approximately 300 pre-convention 
and convention presentations, over 50 address diverse topics u. an ever-widening 
range of studies in second language research. 

As theory and research in second language learning and teaching have 
opened up new vistas it has l^en tempting to look for immediate applications 
in the classroom. Indeed there is a potential two-way street of interchange be- 
tween theory and practice which will benefit both but observe that both direc- 
.ions are well-marked with signs for caution. One direction runs from develop- 
ments ir theory and research to changes in classroom instiuction; the other, 
reciprocally, from changes in classroom instruction to further developments in 
theory and research. But let the traveler beware! Research scholar* have been 
the first to stress the need for caution, lest the traversing of this two-way street 
be premature. 

Elaine Tannic. Merrill Swain, and Ann Fathman, (1976), discussed Some 
Limitations to the Classroom Applications of Current Second Language Acquisi- 
tion Research. They began with tins assessment: 

Research in second language acquisition has developed rapidly in recent years. 
With this rapid development has come an increasing desire on the part of some 
teachers and researchers ti> apply the result of research to the language clas.sroom, 
However, although progress has been made, the reality of the .situation is that sec- 
ond language acquisition research is still m an infancy stage, and hence cannot yet 
provide the classroom teacher with the kind of reliable guidelines needed to effect 
curriculum change, 



G. Richard Tucker (1077) leinforeed their call for Wit ion with these 
words: 
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I would argue stroogly (as do Tarone, Swain, and Fa thin an) that it is premature to 
consider revising syllabi or instructional approaches to reflect the results of these 
studies; although . . . there are certain broad areas of application for the results or 
second language research. 

Finally J. P. B, Allen, (1976), has made the following observation: 

. . . there is perhaps, something wrong with the- idea that the only way to apply' 
results of research 'a to write a whole new textbook or a brand new curriculum 
sequence. Perhaps it is better to see the current applications of research as com- 
prising an influent which indirectly and subtly changes the teacher's attitude to- 
wards what she or he is trying to do m the classroom — changing the teacher's at- 
« titude towards errors, for example, or leading the teacher to pay more attention to 

fonr»s the students are producing. Such a change in attitude may be the most im- 
portant application of current research which can be made to the field of language 
teaching. 

In relation to materials development then, it is essential for all ESL pro- 
fessionals to keep abreast of continuing developments in theory and research hi 
language, language learning, and language behavior. From an informed posi- 
tion knowledgeable judgments relating theory and practice can be made and 
potential implications and applications, direct or indirect, explored or exploited. 

3. The "instructional revolution- in ESL. A third development began in the 
late 1960*s and quickly gained both momentum and magnitude during the 
1970's. Called an ESL "revolution" by some it has brought about significant 
changes in the form and in the substance of ESL classroom practices. The 
emergence and rapid growth of this movement related primarily to two factors: 
(1) a growing disenchantment with traditional ESL methodologies and ma- 
terials, and with student achievement, and (2) an increasing demonstration of 
the failure of structural linguistics and behavioral psychology to account for 
crucial properties of language/language learning, and language behavior. 

A review of TESOL publications over the past twelve years clearly docu- 
ments the state and the stages of change. Forty-eight TESOl* Quarterlies, 62 
TESOL Newsletters, and five ON TESOL** chronicle problems and solutions, 
conflicts and resolutions, accusations and refutations, attacks and defenses, para- 
doxes and panaceas, dysarthrias and dyspepsias. 

As a profession, we have been engaged in re-evaluating and restructuring 
curricula, methods, materials. These endeavors have affected every single do- 
main of TESOL — English as a second dialect, EFL for foreign students in 
English-speaking countries, teaching English abroad, and ESL in elementary 
1 education, ESL in secondary education, ESL in higher education, ESL in adult 
education, and ESL in bilingual education. 

Significant features of the "instructional revolution" in ESL include the 
following: (J) a focus on the individuality of the learner, (2) a focus on the 
learner's language, (3) a focus on the special purpose needs of particular stu- 
dents, (4) a focus on genuine communication and (5) a focus on a humanistic 
classroom. Indeed, as Dubin (197H) noted in addressing the fifth of these 
features, 
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If authors of textbooks of the future are to respond tb all or the .significant changes 
taking place in language pedagogy, they cannot overlook the social climate in which 
learning takes place. In humanistic classrooms — and there are more and more of * 
i them — people take responsibility for their own learning. 

i Perhaps the extent of changes in ESL classroom practice can best be cap- 
tured in noting a recharacterization of the roles of teacher, learner, and learn- 
ing materials — new sets of expectancies and new responsibilities. 

3.1 Teachers: New insights have moved us to reject as untenable a concept 
I of the teacher role as primarily one of model and drill-leader. The teacher role 
I today emphasizes the importance of the teacher as a manager of learning ex- 
periences and a facilitator of learning. Teacher-training programs today and < 
in-service sessions on teaching resemble very little those of ten or c en five, 
years ago. 

3.2 Learners: Similarly, we no longer find tenable a concept of the student 
role as primarily that of repeater of forms and patterns. Today students aje 
recognized as the active creative prime-movers in their owrl learning process. 
In Corders (1976) words: "Efficient language teaching must work with, rather 
than against, natural processes, facilitate and expedite rather than impede 
learning. Teachers and leaching materials must adapt to the learner rather than 
vice versa.** 

3.3 Learning Materials: In ESI. classrooms the written textbook has always 
been a central force, It dictates the specific written sample of English to which 
a group of students is exposed, and it controls the specific spoken subset of 
English which the teacher a.id the students use as they the textbook and engage 
in activities related to it. Said another way, the written textbook determines, in 
large part, the nature of the language input which is made available for stu- 
dents It provides the language data from which students form and shape a 
view of target English. 

By-and-large the days have passed in which dependence upon one book 
(or set of book?) based on linguistic forms alone can be considered the most 
fruitful base for an efficient and effective job of facilitating second language 
learning. Today teachers* concerns about learning materials are influenced by 
changing assumptions about language, language learning, and learner processes. 
In particular, our view of the role of learning materials has expanded to include 
a central emphasis on communication and language function, as well as on 
language form. 

In relation to materials development then, note the impact of these im- 
portant changes in ESL concepts. First with classroom attention moving to a 
special emphasis on language function as well as on language form, finding 
an appropriate selection of texts becomes more difficult. Second, once selected, 
prepared textbooks almost always need to be adapted in order to provide for 
specific student needs. A* Madsen and Bowen (1978) observed: "Every teacher 
is an adapter of the text used. The more experienced and sensitive the teacher, 
normally the greater the adaptation." Third, in addition, texts often need to be 
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enriched with supplementary materials. Increasingly, teachers are expected to 
become their own part-time materials writers. This, in turn, places more pres- 
sure on college and university teacher-training programs fc> expand their course 
work in textbook selection and materials adaptation and to develop courses in 
classroom research and materials development. Fourth, with the demand for 
new texts on the increase, there is a more and more urgent need for creative, 
knowledgeable, experienced teachers to become full-time or at least part-time 
professional textbook writers. f 

In the larger picture, if the research and experimental work of materials 
development is to yield the highest quality product, more and more college 
and university departments and school systems, as well as government agencies, 
will need to do what a number already have clon^support their commitment 
to advancing the efficiency and effectiveness of second language learning and 
teaching with reasonable financial subsidy and related teacher time. 

3. Expansions in ESL 

Three significant developments in ESL have been discussed, the ESL popu- 
lation explosion, the knowledge explosion, and the instructional revolution in 
ESL, Working togetter, these three forces have contributed to a period of 
enormous expansion mhe teaching of English as a second language, an expan- 
sion which has been felt in every aspect of professional activity. 

We need look no further than the growth of our own organization to see 
the impact of expansion. As Executive Secretary James Alatis (1976a) has sum- 
marized, "\ . . once an organization gees beyond 5000 members, it is no longer 
a small organization, or even a medium-sized one. It is a large organization, 
and a powerful one " Today TESOL membership is over 8000; when TESOL 
began in 1967, the membership was 357. 

In the following section three important expansions of the last twelve 

years will be cited. 

1. Professional education. One expansion is the ways aud means by which 
we deal with professional education. In teacher preparation, in order to keep 
pace with mounting demands, college and university training programs have 
increased in number and have expanded the scope of course offerings and prac- 
tice teaching experiences. As teachers we feel an urgent need to keep up with 
the knowledge explosion and to monitor developments in the instructional 
revolution. To cope with this need, organizational forums have increased in 
number, in frequency, and in scope. We attend more and more meetings—large 
national and international conferences— and smaller ones under the auspices of 
national, regional, state, provincial, and city jurisdictions Some of our forums 
are more specialized, and some are more comprehensive than ever before. 
(Compare, for example, the TESOL convention programs of 1969 and 1979. 
In 1969 the convention index included the names of 142 participants; in 1979 
the convention index listed 877 names.) Similarly, in the area of in-service train- 
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ing, there is continual increase and diversification. In short, we get together 
more often, in more places, and have more to share with one another. 

2. Textbooks available. A second expansion is apparent in the textbook 
marketplace. In the past twelve years, publishers* inventories have doubled, 
and in some cases tripled. More textbooks are available today including a wider 
variety of specialized types of learning materials which did not even exist a 
few years ago. Vet there is a constant pressure for new texts arid materials, and 
hence a need for more teachers to make a part of their career option one of 
professional materials development. 

3. Teacher responsibilities. A third expansion involves the increased re- 
sponsibilities of ESL teachers. Despite our preoccupation with reassessments of 
classroom practices and despite emerging redefinitions of what language learn- 
ing may actually entail, unfortunately, unlike industry, we cannot stop our 
work while we retool for instructional change. It is the ESL teacher in the 
classroom who must institute giadual but continuous change, on the one hand, 
while carrying on the details of daily operation, on the other. 

As noted previously, even textbook selection is no longer necessarily an 
easy task. Further, once chqsen, the prepared text usually must be adapted and 
often supplemented. As Stevick (1971) has noted in the introduction to Adapt- 
ing ami Writing Language Lessons y " . . adaptation i.s inevitable; it ought 
therefore to ieceive more attention .and more prestige than it usually does." 

In :!iort, a 'large part of the responsibility for instructional change is borne 
by the ESL teacher who must become teacher/curriculum writer and teacher/ 
materials developer. For many teacheis, however, these are jobs which they 
do not feel altogether well-equipped to do without further course work and/or 
special in-service training programs. Without the assistance of specialized train- 
ing, many teachers who must assume the responsibilities of designing curriculum 
and developing materials, will continue to be frustrated by the disparity be- 
tween the training available and the requirements of their jobs. 

4. Specialized training in ESL materials development: 
a professional responsibility 

Significant developments and expansions in ESL have been reviewed and 
related to materials development. In this section a call will be made for ex- 
panding professional training in aspects of evaluation adaptation and research/ 
development of ESL instructional materials— an increasingly critical area of 
ESL responsibility and one which must become a priority concern. Specific 
suggestions for expanding this component of ESL professional education will 
be outlined. 

1. Professional planning for materials development. As a profession, what 
can we do that we are not now doing to best prepare to meet the following 
needs. (1) a growing demand upon ESL classroom teachers for more and more 
materials adaptation and supplementation, (2) an increasing need for more, 
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highest quality, both general and increasingly specialized textbooks and other 
learning materials. 

Clearly, we are moving rapidly into new frontiers in inateMls development. 
The new 'fron*iers are here and now, and they need the most thoughtful' of 
explorations. There is a gap between theory and practice. But .the gap will 
not be narrowed by fads, nor by putting new names to old theoretical and 
pedagogical friends, nor by juftming u.t bandwagons of premature attempts to 
translate theory into practice. L£ ^ than .well-informed and wejl-formlildted 
attempts to ,do :,o can only contribute to disappointments, to instability, and 
to an illusion of bridging the gap . . . which under close scrutiny is more 
apparent trien real. , 

- Materials development is a creative process, one which is exciting, chal- 
lenging, time-consuming, Exacting, and often frustrating. There a built-in 
need for experimental trial-and-error, and no matter how careful ..ie planning, 
some developments do occur by chance. But the highest quality, most efficient, 
and most effective learning materials are^seFom the product of chance. They 
are the product of design. And quality d'.ajfn is the* product, of three funda- 
mentals: (1) sound knowledge of theoretical constructs relative to the^ project, 
(2) sound knowledge of pedagogical constructs relative to the project, (3) 
experience with the project fn ESL classrooms (continuing throughout every 
interim phase of' development). 

Further, as design is translated into substance, it is the responsibility of 
the materials developer to maintain a continuous monitoring of issues relating 
to language and language learning processes, to the language learner, and to' 
the language teacher. 

Again then, as a profession, " . . what can we do that we are not now doing 
to best prepare to meet the present and future needs in materials development 
in ESL? The answer necessarily lies in yet another expansion i . profe* -ional 
education, in career preparation for future ESL teachers, in continuing educa- 
tion, in education for chan^t ind in ourselves. In the three Subsequent sections 
suggestions will be outlined for expanding specialized training in evaluating and 
adapting instructional materials and in researching and developing materials; 
in teacher training programs, in in-service training programs, and in convention 
programming. \ 

2.- Teacher training programs. Suggested Curriculum Expansion: An in- 
crease in specialized course offerings in ESL materials development with & 
wo (or trtfee) course sequence which provides for a balance of :>f 1 ) theoretics 
foundations and the form and function of learning materials; (2) practical ex- 
perience in evaluating, adapting, researching, and developing materials, 
(bourse content possibilities include th<* following: 

(1) a chronological survey oc theory in second language learning and 
teaching; special attention to influences on curriculum design and 
the form and function of instructional materials. 
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(2) f a review of the hist^ and development of ESL learning/teaching 
materials; special attention to the needs of intended student audiences 
and to a study of geographical use patterns. 

(i>) individual and/or group projects *i criterion-referenced evaluations 
of selected ESL textbooks (and other learning materials); special 
attention to goals, intended student audience, student involvement, 
content, sequencing, presentation format(s), measures for assessing 
teaming, and teacher involvement, as they reflect theoretical/pe og- 
M ical considerations. 

(4) individual and/or group projects in assessing the needs of a particular 
group of students, selecting a set of appropriate learning materials, 
outlining intended use schedule, and projecting possible adaptations 
and supplementations. 

(5) specific* individual and/or group projects in researching, designing, 
and developing a (limited) set of instructional materials. 

3. In-service training progfoms. Suggested Program Expansion: An in- 
crease* in specialized in-service training programs devoted to ESL materials 
evaluation/adaptation and research/development: 

(1) - under tlu individual or joint sponsorship of public or private school 

systems, community colleges, adult education programs, public or 
private institute programs and/or government programs, and 

(2) focused on tht relevant materials development needs of one or more 
(closely related) areas of TESOL: English as a second dialect, EFL 
for foreign students in English-speaking countries, teaching English 
abroad, ESL in elementary education, ESL in secondary education, 
ESL in higher education, ESL in adult education, and ESL in bilin- 
gual education. 

Format and content possibilities (with attention to both theory and practical 
experience) include the following: 

(1) mini-courses in reviewing recent developments in theory in ESL 
learning and teaching; special attention to implications for curriculum 
design and materials development for ESL students in specific educa- 
tional settings. 

(2) study groups in evaluating ESL materials; special attention to goals, 
intended student audiaice, student involvement, content, sequencing, 
presentation format(s), measures for assessing learning, and teacher 
friWvement, as they reflect theoretical/ pedagogical considerations. 

f (3) study groups in assessing student needs and -developing guidelines 
for selecting appropriate learning materials ahd outlining adaptations 
for a specific student group. 
(4) demonstrations and workshops devoted to adapting and supplement- 
ing selected materials to meet the needs of a specific student group. 
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(5) workshops in researching, designing, and developing instructional 
materials for a specific student group. 

4 Convention programming. In the past few years both national and 
affiliate TESOL convention programs have included a growing component of 
demonstrations and workshops devoted to practical aspects of materials develop- 
ment. The enthusiastic 'reception of these sessions underscores the need for 
continuation of convention programming devoted to these topics. 

More recently, with the introduction of mini-courses at TESOL 77, there 
is now a format for a longer inte: t sive study session in which non- specialists in 
an area may benefit from the equivalent of a "short course" taught by a 
specialist. In future mini-courses, theoretical considerations as well as practical 
aspects of materials development topics can and should be explored. A variety 
of annual up-to-date short courses can provide ESL professionals with oppor- 
tunities for continuing education, (and education for change), relating theory 
in second language learning and teaching to implications for curriculum design 
and materials development. 

Finally, future convention programming could well include colloquia in 
which specialists in materials development devote time and discussion to 
theoretical and pedagogical issues relating to researching, designing, and de- 
veloping instructional materials for studenb in a variety of particular circum- 
stances. 

There should be an increase in convention sessions devoted to both theo- 
retical and pedagogical aspects of ESL materials development— papers, demon- 
strations, workshops, mini-courses, and colloquia. 

5. Concluding notes 

ESL has been undergoing substantial changes and expansions for many 
years. Changing concepts about the nature of second language learning and 
learners have influenced language pedagogy and are beginning to b* reflected 
in curriculum design and in the form and function of instructional materials. 

There is a serious need for careful profession-wide planning for specialized 
training in ESL materials development if we are to meet the demands of emerg- 
ing needs and the challenges of new dimensions in instructional materials. 
Recent developments in the study of language, language learning, and learner 
processes should be reflected in new materials but thoughtfully and knowl- 
edgeably so that disappointments and disenchantment:; are minimized so that 
the new frontiers in materials development are explored r ul!y not by 
chance, but by design. 
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Bilingual Education: Research and its Implications 

M«rrtll Swain 

The OntarS^ Institute for Studies in Education 

Bilingual education, has different meanings to different people. By bilingual 
education I mean the use of two languages as mediums of instruction at some 
stage in a student's educational career. This allows for wide variation with 
respect to- when the languages are used tthin any grade level or across grade 
levels. Thus, included ar bilingual "educational, are programs such as French 
immersion programs where English-speaking unilingual students are initially 
instructed in French and do not receive instruction in, or about, their first 
language until the third or fourth year of schooling, as well as programs for 
unilingual Chinese students who are taught initially in their mother tongue and 
introduced to instruction in, or about, English simultaneously or at later grad«* 
levels. Bilingual education ,does not, and should not, exclude the teaching of 
either language as language per se, but it necessarily involves using both lan- 
guages as vehicles of instruction. 

The sheer volume of words that have been devoted to the topic of bilingual 
education is, to say the least, overwhelming. One can read about bilingual 
education from a linguistic perspective, an educational perspective, a philosoph- 
ical, psychological, sociological, political, historical or a legal perspective (see, 
for example Spolsky and Cooper, 1977; Centre for Applied Linguistics, 1977). 
One can find discussions and descriptions of bilingual education in multilingual 
societies in both developed and developing countries, of bilingual education for 
immigrant groups, for indigenous populations, for minority language groups, 
as well as for majority language groups ( see, for example, Lord & Tsou, 1976; 
Spolsky, 1972; Swain and Bruck, 1978). And one can read about research and 
evaluation studies assoeiat?d with bilingual education (see. for example Lam- 
bert and Tucker, 1972, Cohen, 1976; Mackey, 1972). 

However, attempting to jcome to grips with all the literature, and the 
contradictory conclusions reached in the various research and evaluation studies, 
quite simply, boggles the mind! Consider some of the research evidence, for 
example'. On the one hand, evidence has been presented which suggests that 
bilingual education leads to enhanced cognitive development, high levels of 
achievement in content learning, high level: of proficiency in second language 
skills, . enrichment of first language skills, increased feelings of self-worth, as 
well as more positive attitudes towards schooling and towards other ethnic 
groups. On the other hand, evidence whidi supports virtually ,,the opposite 
conclusions has also been presented. 
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Can these differences be reconciled? I believe they can. Furthermore' I 
think they must be,, if our concern is with the provision of educational programs 
which maximize a child's opportunity for learning and personal growth. What 
I want to do, then, is to try to provide a basis /for understanding the con- 
tradidlory conclusions associated with bilingual education. In order to do this 
we wiU need to consider differences in programs, differences in the children 
attending the programs, differences in the* communities in which the programs 
operate, and differences in the research strategies employed in the studies 
themselves, 

As^one outcome of this exercise, several underlying concepts which in 
part account for the contradictory results emerge. And it is with a brief discus- 
sion of these concepts that I will conclude. - 

So let me turn now to examine some contradictory evidence. In doing so, 
it is relevant to note that although studies which have examined the effects of 
bilingual education have measured' a variety of student outcomes, they have 
all been concerned with at least two questions, and these are the questions 
we will consider here. First, how do students in a bilingual program perform 
academically relative to similar students who are not participating in the 
program''- Second, how do the linguistic skills of students in a bilingual educa- 
tion program compare with those who are not participating in the program? 

The results that I will refer to come mainly from two different sources: 
the Canadian data on early total French immersion education pnd the American 
data gathered by the American Institutes for Research (Danoff, M. et al. 1978). 
The Canadian data have been collected by a number of researchers over th'#> 
last decade or so, each working in their own ]gcal area (see Swain, 1976 for 
a bibliography)'. The American Institutes for Research (AIR) study, a study 
* commissioned by the US Office of Planning, Budgeting and Evaluation, ex- 
amined the impact on. Title VII students in Spanish-English programs. To do 
this a large nation-wide representative sample of Title VII projects was selected 
and compared to similar students hi non-Tit*r*VlI programs. The major part 
of the study was conducted in the 1975-76 school year and included students 
from grades 2 to 6. 1 The programs from which the students were selected were 
in their 4th or 5th year of operation. 

With respect to the first question, how do students in a bilingual program 
perform academically relative to students not participating in the program, 
studies of French immersion programs in Canada show that students in the 
French immersion program obtain scores equivalent to their English-educated 
counterparts on standardized tests of Science and Mathematics, typically scoring 
at or above their expected grade level in relation to national norms. On the 
other hand, the AIR study showed that Title VII students either performed 
similarly or not as well as non-Title VII students, and both groups scored below 
their expected g r ade level in relation to national norms in Mathematics. The 

1 In the fall of 1976, a sample of classrooms from tne grades 2 and 3 cohorts was tested 
again when the students were* in grades 3 and 4. 
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expectation, of course, was that the Title VII students would do better than 
non-Title VII students. 

With respect to the second question, how do the linguistic skills of students 
in a bilingual program compare with those who are not participating in the 
program, stilus of French immersion students in Canada show that in relation 
to their first language, there is ah initial lag in English language skills on the 
immersion students relative to their English-educated counterparts. However 
differences between the groups disappear by no later than grade four. In I^er 
grades, immersion students in some cases out-perform their English educated 
counterparts in various aspects of English language skills, and tend to stand 
at or above their expected grade level in terms of national norms. It is interest- 
ing—and we will return to this point later— trjat little attempt has been made 
to measure first language proficiency in the U.S. programs. However, in the AIR 
evaluation of Title VII programs just mentioned, Spanish reading was measured, 
and no differences were found between the scones of the students in the Title 
VII programs and similar students not participating in Title VII programs. The 
expectation was of course, that students in the bilingual program would do 
better. 

In relation to second language skills/ the Canadian French immersion 
results show that in reading and listening native-like levels are attained, but 
not in speaking and writing. On a Frf ch achievement test standardized on a 
Quebec francophone population, immersion students score around or below 
the 50th percentile; whereas, the AIR results showed that on a standardized 
Jest of English reading and vocabulary knowledge the Title VII students score 
around the 20th percentile in grades 2 through 8. 

I have taken extremes to illustrate how different the results can be. There 
are, indeed, other studies which show equally positive or negative results 
The issue being raised 'here, however, is 'what possible factors might account 
for the differences in results. How, in fact, can these differences be reconciled? 

First, there is the issue of how the data are interpreted with respect to 
expectations. Second, there are important differences in the strategies employed 
in carrying out the studies themselves that may account for the different results. 
And third, there are a variety of background, student and program variables 
that mav also account for the differences, These will be examined in turn. 

First, it is, important to note that the goals of the two programs are differ- 
ent. In the French immersion program, the goal is to learn a second language 
without a decrease in native language skills or academic performance. In the 
Title VII programs, the goal is to learn a second Iang»;age and increase academic 
performance. The role of the mother torigue is seei, as a bridge so that stu- 
dents are not prevented from learning content, material while the second lan- 
guage is being learned. Thus, the results just mentioned are interpreted as posi- 
tive in the case of the Canadian programs because the goals have oeen at- 
tained However, in the U.S. the results are interpreted negatively because 
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the expectations have not been met. However it is important to note, as"Eappert 
and Cruz (1977) have done; and I quote them: 

a non significant effect is not a negative finding with respect to bilingual education. 
A non-significant effect, that students in bilingual education clashes are learning at 
the same rate as students in monolingual classes, demonstrates the fact that learning 
in two languages does not interefere with a student's academic and cognitive per- 
formance. Students in bilingual classes have the added advantage of learning a sec- 
ond language and culture without impeding their educational progress. Under these 
circumstances, a non-significant finding can be, interpreted as a positive effect of 
bilingual educ *3R« 

Let us consider next, the nature of the differences between the two sets of 
studies. Although the AIR study evaluated student performance frorrl grades 
two to six, it did so by looking at each grade level independently and examined 
achievement gains over approximately a six month period within each grade 
level. 2 Although most of us may claim to be optimistic^about the effects of 
education, even the most optimistic amongst us may consider that six months 
is a rather short period of time to expect significant changes to occur in one 
group relative to another, especially when one considers that the six-month 
period is not the only "input" time. 3 That is to say, gains may have already 
been made by the students during previous years in the program, and these 
previous gains would not be evident in a comparison of gains over a six month 
period. 

In this respect, the studies on immersion education have been considerably 
different. The studies have, followed students for considerably longer periods 
of time, in most cases from kindergarten year to their current grade level, thus 
^ allowing for the, cumulative effect of the program to be determined. 

The problem of looking for significant differences in gains over short 
periods of time in the AIR study is compounded by the fact that very little 
was known about the previous educational experiences of the students, or their 
linguistic or attitudinal characteristics at the onset of their experience with 
bilingual education. We might ask in what ways were the Title VII students 
different from those who did not enter a bilingual education program? Pre- 
sumably they were having difficulties with Engluh, yet the results show that 
by grade 5 they were making gains equivalent to non-Title VII students, and 
were in fact scoring comparably on standardized tests of English. 

Many of the students in the study had been placed into the bilingual 
program part way through their elementary education. For example, the average 
number of years previously spent in a bilingual education program for the 
grade 6 students in the AIR study at the time of testing was approximately 
three. What was the effect of the switch? Perhaps their parents did not want 

2 It should be noted that there was a full year between testings of the grades 2 and 3 
cohorts mentioned in footnote 1, with approximately similar results, 

' 1 For (he 1975-76 sample, about 75* of the students in grades 2 to 6 had two or more 
* years of bilingual education, and 50* of the students in grades 3 to 6 had three of more 
yean. 
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them in a bilingual education program because they feared their child might 
^ not learn English in a bilingual class? Perhaps switching was perceived by the 
child as a sign of failure in school? Perhaps the Spanish taught and used in 
class was not their own dialect of Spanish only to reinforce already existing 
feelings of inferiority? 4 

And what about the students who were in a bilingual 'education progr&n 
in previous years and have since moved to non-Title VII programs because 
they have been judged to have sufficient knowledge of English to cope with 
instruction in English? Aside from the fact that their absence would have the 
effect of lowering the English scores attained by Title VII students, shouldn't 
they also "nave been studied as part of the impact of Title VII? 5 

In the Canadian immersion studies, on the other hand, the initial character- 
istics of the students are known. They are unilinguar English-speaking children. 
They come from middle to upper middle class homes, and their parents have 
chosen to enroll thefn in the program. Students do not leave the program when 
they have sufficient knowledge of the second language, but rather they remain 
in the program continuing to study in both their first and second languages. 
There is, however, attrition, and although it has not been studied systematically 
*yet* it is an important question to ask why some students leave the program. 
In some cases it may be because the program was perceived by their parents 
* * or teachers to be too difficult for them, although recent data from Alberta 
suggests that moving was the main source of attrition. However, just as the 
'"drop-outs" of Title VII programs might perhaps beP viewed as a sign of the 
success of the program, perhaps the "drop-outs" of French immersion programs 
might be viewed as a sign of its failure. 
w The point being made here is that. without knowing what th6 initial char- 

acteristics *of the students were, and the cumulative effects of the program on 
the students, it is impossible to conclude that the results are either negative 
or positive. Interpretation of the results as either positive or negative is de- 
pendent on knowing wljat the children were like at the beginning of the 
program, and observing the effects over titye of the program (see also, Cummins, 



Another difference between the AIR and Canadian studies is that of lump- 

4 D. Reynolds has since pointed out to me "that these speculations are not backed up by 
any evidence that the students were indeed switching into and out of the Title VI 1 project 
classrooms for the reasons you suggest If we are to conjecture^ about reasons for sixth graders* 
being in dm program for fewer than six years, we might suggest that the students came in 
late because they recently moved into a district with a Title Vll program, because they 
recently arrived in the United States, or l>ecause slots in the classroom opened up when the 
students originally enrolled moved out of th<? school district. (There is a >.ery high mobility 
among the student populations in inc»t of the districts with Title VII programs) (personal 
communication, May 11, 1979). 1 see no disagreement beUveen reasons Ave suggests 
for why the children were switched, and what 1 am siggcting — the possible psychological 
effect of the switch. 

:> li should be noted, however, that "AIR found altnosr no programs consciously moving 
students out of the bilingual classrooms once they had learned English. No e*it criteria were 
in effect, and 85% of the projects studied considered themselves native language maintenance 
programs" (D. Reynolds, personal communication, May 11, 1979). 
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ing together the results across different program variations. This simply ha3 
not' been done in Canada. Comparison of results across different program 
variations have been made on an ad hoc basis (Swain, 1978), and generaliza- 
tions from similar programs ongoing in different communities have been drawn 
(Swain, 1979). The AIR study has taken the opposite tact, and averaged results 
across communities and program formats. Given also the heterogeniety among 
students in terms of their abilities in each language, their language use patterns, 
their attitudes toward school and the use of each language from program to 
program, as well as within programs, the results are basically useless in provid- 
ing information that would be helpful to program planners. Much more useful 
would have been to analyze results according to known differences in student 
and community characteristics ai.d program treatment, examining the individual 
results to determine if any generalizations could be drawn, even if tentative. 

In effect the AIR study violates the very common sense point that to 
determine what kinds of programs work best with what kinds of children 
under what circumstances, one needs to look at the effects of individual pro- 
grams and community variables as they interact with initial student character- 
istics over a long enough period that cumulative effects of the program can 
be observed. Most likely some of the programs were effective at achieving 
their aims, and some were not. What would be interesting and useful to do is 
determine the characteristics of the successful and unsuccessful programs. 6 

Finally in terms of research methodology it is worth mentioning a problem 
that plagues both sets of studies: the instruments used to assess student lan- 
guage abilities and achievement levels. Here, the best that can be said at the 
moment is that we need to develop much more sophisticated tools for the 
assessment of linguistic abilities. We must address the issue too, of language 
for what purpose? — for communication? as an instrument of thought? for self- 
reflection? for artistic and literary purposes? for what? 

What do current language tests measure? Most of the tests used measure 
vocabulary knowledge, reading and grammatical knowledge. Grammatical 
knowledge would appear to be a necessary although not sufficient condition for 

6 It is the case that in the AIR study, relationships were examined between student 
achievement and educational processes (e.g. extent of grouping versus individualized instruc- 
tion, hours devoted to ESL and 'SSL instruction, use of languages in class), teacher and 
teacher aide characteristics (e.g. level of education, teaching credentials, years teaching, 
training in bilingual education), and contextual variables (e.g. proportion of Hispanic stu- 
dents in class, home and neighborhood language use of English) for a subsample of the 
programs. Although several interesting overall patterns emerge, the problem remains that the 
analyses lumped together all the programs rather than analyzing individual programs that 
were successful or unsuccessful and relating these outcomes to the above mentioned variables. 
And. unfortunately, the analyses used, partialied out the effect of several of the factors which 
are being suggested here as key factors, e.g. preference for English versus Spanish, and the use 
of Spanish and English. But indeed, the authors are to be commended for collecting and 
analyzing such significant data. The major suggestion is that further analyses on a class-by- 
class basis would he more revealing if the aim is to understand what types of educational 
treatments are most successful for students of different linguistic and attitudinal characteristics 
in communities of different linguistic and attitudinal characteristics, given the program ^goals. 
In all fairness to the AIR study, however, it must Ik stated that its aim was not to evaluate 
any specific approach to bilingual education, but rather to evaluate the overall impact of the 
Title VII program. 
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communicative performance, but clearly, tests of grammatics! knowledge cannot 
substitute for tests of communicative performance. Then we must address the 
issue of what constitutes minimal levels of commun7cative performance if that 
is to define limited English (French, Spanish) speaking abilities. And if it is 
to define the necessary minimal level, then it is essential to consider the rela- 
tionships between this minimum level of communicative performance and the 
psycholinguistic concept of threshold level as proposed by Cummins (1978a; 
1979a) and which will be discussed shortly. 

As well as differences in approaches to the evaluation of programs, there 
are many differences both in the characteristics of the students involved in the 
Title VII programs and those t>f the students involved in immersion education. 
Many of these have already hem mentioned in the preceding discussion, but 
it is important to note the considerable heterogeneity both across and witftin 
programs of student characteristics in the U.S. programs/especially as relates 
to their first and second language abilities at the onset of the program, and 
the students perception of their participation in the program. The common 
characteristic among students participating in the Title #11 pr6grams as stip- 
ulated by thf previous legislation was that they were from lower class faipilies. 
The immersion students, on the other hand, form a much more homogenous 
group and for the most part, come from middle to upper-middle class homes. 

Similarly there are differences in communities — in community attitudes 
towards the use of the languages involved and towards the program format. 
Where immersion programs exist, it is usually because the community has 
fought for it, often against administrator resistance. In effect, the programs have 
been tailored to meet the community demands. For example, in Ottawa and 
MontreaJI where the need for French is perceived as greater than in, for example, 
Vancouver, more instructional time in French has been demanded. In the U.S. 
programs, it has often been the case that program formats have also been 
developed to meet community demands. However, it has also been the case 
that in some, communities the programs have been im t osed by legal mandates 
on communities which are more concerned with having their children learn 
English than maintain their mother tongue. To them it is anti-intuitional that 
bilingual education can provide both. 

And finally there are program differences. The typical format of a French 
immersion program is that all instruction is given in French, the second lan- 
guage, until grade 2 or 3. Only then is English Language Arts introd ced and 
taught in English for approximately an hour a day. With each successive year 
thereafter a larger proportion of the curriculum is taught in English until an 
approximately equal balance is reached between the time devoted to instruc- 
tion in the mother tongue and the second language. 

The first French immersion program in the public sector began in the 
Montreal area some 14 years agu. Since then other communities have pushed 
for an immersion program to be established, due in large part to the reported 
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success of the original program. Each program has begun at the kindergarten 
level and each year added a successive grade. Thus, the growth both within 
programs and across Canada, has been slow and steady. Millions of dollars 
have been spent on the development of curriculum materials, and on evalua- 
tions of individual programs by a variety of researchers. The data collected 
have been fed back to the program developers, leading at times to changes in 
instructional emphasis and instructional techniques. 

The programs in the U.S. are as different as are the students who enter 
them and the communities they are housed in. The extent to which eaph lan- 
guage is used, by whom, for what purposes, to teach what, vary from program 
to program. In addition many programs were implemented before materials 
were prepared or teachers were trained. 

As I have already mentioned, there are differences in the goals of the_ 
programs. Title VII is trying to accomplish several things at once: to increase 
the level of academic achievement, to strengthen self-image and culture identity, 
and to develop second language skiUs. In the immersion program, the main aim 
is to develop second language skills without a decrease in academic achieve- 
ment or mother tongue development' In the immersion program, nc one is 
concerned with improving the child's self-image or cultural identity, it is not 
judged to be low in the first place. 

Let us consider for a moment a common goal — that of learning a second 
%|*nc;uage. Here I think the Canadian research has an important message to 
offer and that is, that in spite of the fact that immersion students have every- 
thing going for them" — that is, they *>me from middle to upper-middle class 
homes, they have parents who are t ed in their education and motivated 
to have their children learn French, they have native French-speaking teachers 
and so on — they still have not mastered the spoken and written codes with 
anything like native proficiency. In spite of their lack of native-like proficiency 
in speaking and writing, the immersion students have developed sufficient levels 
of understanding and reading to be able to deal effectively with the material 
presented to them in the second language. This suggests that limiting the de- 
finition of those who need English to profit from instruction in that language to 
speaking abilities only is perhaps misguided. In light of this, it is a significant 
step that has been taken by the U.S. Federal legislation to open the Title VII 
program to those with limited English proficiency in all four skills — understand- 
ing, speaking, reading and writing. As I have already mentioned this means 
that considerable efforts will be necessary to establish what is to be meant by 
minimal levels of proficiency. 

Cummins (1978a, 1979a) has referred to the level which must be attained 
in a language in order to profit from instruction in that language as the threshold 
level. Cummins has also suggested that the development of second language 
competence 7 is a function of the level of the child's first language competence 

7 For Cummins, language competent mrans "the ability to make effective use of the 
cognitive functions of language, that is to use language effectively as ,tn instrument of 
thought and represent cognitive operations by means of language" (Cummins, 1978 foot- 
O me 21). 
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at the time when intensive exposure to the second language begir . This implies 
that because the level of competence in the second language is dependent on 
the level attained in the first language, it is important to ensure that the thresh- 
old level is also attained in the first language. There is evidence to support this 
notion from several studies where initial teaching in the mother tongue plus 
second language classes has led to higher levels of linguistic performance in 
both the first and second languages relative to those students receiving initial 
instruction only in the second language. The studies include Finnish immigrants 
in Sweden (Skutnabb-Kangas and Toukomaa, 1979), francophone minority 
* groups in Manitoba (Hebert et al., '.976) and Alberta (Cummins, 1979b), and 
Navajo children in Rock Point (Rosier and Farella, 1976). This points to the 
critical importance of determining the linguistic proficiency of the students in 
their first language, something which has rarely been done in the U.S, programs 
because the goals have been oriented to second language learning, But if the 
potential level to be developed in the second language is dependent on the level 
attained in the first language, then information about the first language abilities 
of the children will be important in determining appropriate forms of educa- 
tional intervention. 

^The differences in results, then, can be attributed to a number of factors. 
They are rather nicely summed up by the notions of subtractive and additive 
bilingualism as proposed by Wallace Lambert (1975). Subtractive bilingualism 
refers to situations where the "learning of the second language reflects the loss, 
or 'poor" learning of the first language. Additive bilingualism refers to situations 
where the learning of a second language occurs with no loss to the first language. 

For example, in the French immersion program, the children are mem- 
bers cf the dominant linguistic and cultural group. Learning the second lan- 
guage does not portend the gradual replacement of the first language and the 
loss of cultural identity associated with that language. Furthermore the second 
language being learned is a socially relevant, nationally and internationally 
recognized language, through which individual economic advantages may ac- 
crue to the learner. This situation, that is, where the first language is maintained 
while a second language is being learned, is an "additive" form of bilingualism. 
This is in contrast to the situation faced by many immigrant groups, minority 
groups and indigenous populations who perceive knowledge of the majority 
school language to be the gateway fc» social and economic gains, and the home 
language to be of little consequence except in enabling them to communicate 
with their friends and relatives, and preserve ethnic identity. The overwhelming 
use of the dominant language in school and in the wider community often re- 
sults in a "subtractive * form of bilingualism where the learning of the second 
language may reflect some degree of loss of the first language and culture. 

Combining the ideas of Lambert and Cummins, that is, that the potential 
for linguistic development in the second language is dependent on the level 
attained in the first language, and that subtractive bilingualism mav indicate 
lower levels of attainment in the first language — suggests a major implication 
for education. The implication is that if optimal development of minority lan- 
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guage children's academic and linguistic potential is a goal, then the school 
program must aim to promote an additive form of hilingualism. Attainment of 
this goal will necessarily involve a home-school language, switch at some stage 
in the educational process, but when, and how much, must be determined in 
relation to the linguistic and socio-economic characteristics of the learner and 
of the learning environment. Specifically when the home language is different 
from the school language and the home language tends to be denigrated by 
others and selves, and where the children come from socio-economically de- . 
prived homes* it would appear appropriate to begin initial instruction in the 
child's first language with the second language being introduced as a subject 
of instruction. At a later stage instruction in the second language would then 
be introduced. On the other hand, where the home language is a majority lan- 
guage valued by ihe community, and where literacy is encouraged in the home, 
then the most efficient means of promoting an additive form of bilingualism is 
to provide initial instruction in the second language (Swain and Cummins, 
1979). 

These are general statements which now need to be refined by examining 
specific research hypotheses. Specific hypotheses can be generated which take 
into account the level of first and second language competence of the learner 
at the onset of the program, the particular soeio-cultural-political characteristics 
of the environment, and the characteristics of the program itself in determining 
linguistic, academic and psychological outcomes. 

Research has much to offer if we take as our starting point the question: 
which kinds of childr <ue going to do best in which kind of program in what 
kinds of communitie king the question this way recognizes that there are 
many possible forn of bilingual education, that communities may differ 
radically, and that children vary considerably in the characteristics they bring 
with them to school. Student outcomes must be seen as resulting from commu- 
nity and program variables as they interact with student characteristics. 

In summary, the following are the points that I have tried to make; 

(1) Different formats of bilingual education are appropriate for different 
students. The program will interact in important ways with, among other things, 
the linguistic characteristics the children bring with them to school and com- 
munity attitudes towards the languages. Research must direct itself to the isola- 
tion of other key variables. 

(2) Negative results related to bilingual education tend to be associated 
with situations fostering subtractive forms of bilingualism. The challenge facing 
educators is how to turn subtractive forms of bilingualism into additive ones. 
This has been done in some instances, for example, with Finnish immigrants in 
SwedenJIminority francophone groups in Alberta and Manitoba, and indigenous 
populations in the U.S. 

(3) Slow, steady growth of programs, with well-planned curriculum, and 
strong community and administrative support is bound to lead to more effec- 
tive results than "'shot-gun" implementation. 
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(4) In trying to interpret the results of research associated with bilingual 
education, ask yourself the following questions: 

(a) What are the characteristics, especially linguistic, academic and attitudinal, 
of the students before they entered^the program? 

(b) What arc the attitudinal and linguistic characteristics of the community 
in relation to the program? 

(c) What is the nature of the educational treatment especially with regard to 
the use of the two languages, both as taught and as used as mediums of in- 
struction? 

(d) For how long were the students followed? Are cumulative effects of the 
program being demonstrated? 

(e) How is the child experiencing the program, regardless of the program goals? 

(f ) What was the nature of the tests used? What were they measuring? In what 
languages were they given, the child's first or second language, the child's 
dominant or weaker language? 

(5) And finally, in relation to the issue of tests, it is obvious that consider- 
able effort is going to be needed to determine what constitutes minimal levels 
of language proficiency, perhaps most productively pursued by asking what 
are the threshold levels, that is what does the student need to know linguistic v 
in relation to reading, writing, speaking and understanding in order to be auie 
to profit from instruction in that language? 

In conclusion, there is one other point I would like to make. Although I 
have focussed on outlining some of the reasons why we might find different 
results in different studies of bilingual education, it is important to note that 
there is evidence which demonstrates that some bilingual programs have ac- 
complished their aims (see Troike, 1978, for example). It is unlikely that it 
will ever be possible to determine whether or not the fact that the program is 
bilingual is the only reason for its success. However, the fact that it has been 
demonstrated that through bilingual education it is possible to learn a second 
language and enhance learning at no expense to the mother tongue for minority, 
immigrant, and indigenous populations is extremely important. It means it can 
be done! It is possible through the efforts of dedicated teachers and committed 
communities to change subtractive bilingualism into an additive form -of bi- 
lingualism. If we are committed to developing to its fullest potential the lin- 
guistic, academic and personal growth of the child through education, then 
bilingual education can provide the means to do so. 
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Testing Panel 

What Makes Test A Four Letter Word?: Viewpoints 
on Bias in Standardized Testing 

Dorlene Lonon 

» r 

New York University , 

Classroom teachers have repeatedly seen, evidence of ample ability in their 
students from lr listically deferent cultures, yet watched low scores accumulate 
on their records <> whenever these students were tested for achievement and 
ability. They have come to TESOL's Committee on Sociopolitical Concerns year 
after year to beg that we "do something" about this inequity. The time was 
long overdue, and *with the cooperation of the Second Vice President Carlos 
Yorio, the Committee on Sociopolitical Concerns was able to turn the attention 
of TESOL 79 participants to the issue of bias in standardized testing. 

Opportunities for bias in testing are unending. Cilberto Garcia, then of 
Georgetown University, ca ied to our attention at TESOL 78 the many cases 
of bias in the routines and procedures at the testing site, and in the scoring and 
the way the' results were used. Both in the judgment of individuals records, and 
in the guidelines and policies of federal and state education agencies, we must 
be alert to the implications of every regulation. 

Panelists. It 'TESOL 79 whose remarks fallow are: Dr. Ernest M. Bernal, 
Jr. of the Educational Testing Service in Austin, Texas, Dr. John W. Oiler, Jr. 
of the University of New Mexico, Dr. Paul Rosier of the Page, Arizona Public 
Schools and Dr. Roger Wt Shuy of Georgetown University. The remarks of our 
fifth panelist, Dr. Edward DeAvila of Stanford University were not available 
for inclusion. Dr. DeAvila's views can b* found, however, in several articles 
suA as, "Piagetian Alternative to IQ: Mexican-American Study" by Edward 
i/eAvila and Barbara E. Havassy in Issues in the Classification of Exceptional 
Children, Nicholas Hobbs, Editor, Jossey-Bass Publishers, San Francisco, 1975. 

We in TESOL are eager to see more attention paid to finding new kinds of 
assessment instruments and assessment policies which will maximize die op- 
portunities for success for every student, pre-kindergarten through adult, for 
native speakers, non-native speakers, and for those who function in standard 
English as a second dialect. 

I wish to acknowledge my debt to an earlier article by John Upshur, Test 
U $ Four Letter Word," fqr the genesis of my title. 
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What Makes Language Test an Expletive: Tests^ 
Test Selection, and Test Use 

Emtst M. Btmal, Jr. 

Educational Testing S /ice 

1. Ttsts 

The expanding fields of bilingual education and ESL have created a con- 
siderable demand for language assessment instruments for many age levels and 
different educational settings. Many tests, of course, employ the more tradi- 
tional norm-referenced scales and discrete-^bint, multiple-choice formats to 
measure a variety of related verbal skills. As a member of a state language 
proudency instrument review committee, however, I have noted that many 
new tests carry the names of linguists as authors, that language-eliciting proce- 
dures more characteristic of linguistic .research methods are being adapted to 
widespread language testing, and that greater attention is being given to criter- 
ion-referenced levels of competency using integrative or semi-integrative for- 
mats. Fortunately, too, there are tests which seem to incorporate features 
from both "camps," as it were (e.g., multiple-choice cloze tests). 

Yet it is this very demand which has contributed to certain abuses. One 
might say that a bandwagon effect obtains, that certain individuals and test 
publishers are rushing the completion and dissemination of language tests in 
order to capture a profitable percentage of the market, even though many of 
these tests lack the qualities necessary to recommend them (See, for example, 
Silverman et a/., 1970). 

For eicample, at least two 'tests" do not require children to respond or 
even to be observed. They are essentially demographic questionnaires which 
can be filled out using school records or through rn interview with an adult 
family member. Whats worse, no effort has been made to validate these instru- 
ments against behavioral criteria, 'viz, how a child actually uses language. One 
of these instruments appears to have grown out of a family linguistic interaction 
study. 

Other tests either lack a theoretical rationale or are based on as-yet un- 
verified hypotheses about language acquisition or performance. One technique 
assumes that the ability to name tije antonyms of individual lexical items in 
two languages is a good measure of bilingual proficiency, another uses the 
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ability to repeat phrases without errors in pronunciation as adequate to measure 
language dominance. And the list goes on. There are tests which hardly require 
a person to speak at all but only to indicate comprehension, yet which are 
interpreted to indicate oral performance. 

A third category of tests measure only marginally useful constructs for 
educational purposes or confuse the constructs they purport to measure. Tests 
which measure language dominance without yielding an estimate of compara- 
tive language proficiency are actually misleading to educational decision makers. 
Many practitioners do not realize that one can be dominant in a marked lan- 
guage while still competent in the unmarked tongue. In the same vein, proce- 
dures used to determine a person's "preferred" language are not necessarily 
measures of language proficiency. 

Another popular practice is to develop a test for local consumption and 
then to market it nationally, despite its obvious dialectal biases. This cleverly 
shifts the burden of validity entirely to the out~of*region purchaser, who very 
often has to "adapt" the test anyway, thus further confusing the issue by 
admitting non-standardized testing procedures. 

Finally, many of these problems with tests could be avoided if otherwise 
naive test makers would follow or devise some rigorous system of test develop- 
ment. The responsible test developers -publishers have articulated adequate 
systems of test development and generally make them available for the asking. 
(See, for example, ETS f 1979.) These guidelines can be characterized by re- 
search and development (R&D) procedures, in which the ultimate touchstone 
of a test is the empirical analysis of how it fares in the field. 

Certainly one of the principal reasons why test is a four-letter word, why 
so many tests evidence such strong biases against minorities — even to the point, 
in some instances, of not even measuring the same construct (Anastasi & 
C6rdova f 1953) — is precisely this lack of development. The target population(s) 
are inadequately specified, insufficient trials are conducted, revisions are not 
made on the basis of empirical analyses of answers, different formats (e.g., 
different directions, pictures, answer sheets) are not tried out, validity is left 
entirely to someone's "expert" judgment about what constitutes a good language 
sample, and levels of competence are set without reference to the abilities of 
native speakers. It happens, for example, that some linguist's research instrument 
appears quite suddenly in printed form for sonic more general educational use. 
In short, too many language tests are not developed hut merely p.oduced. 

2. Test Selection 

Yet another reason why test is a four letter word has to do with the way 
tests are selected (or kept in use) Too often test selection is a decision based 
upon a single factor, such as cost, administrabUity, scorability, attractiveness, 
popularity (i.e . "Everyone uses this test"), or political favoritism (i.e., "George, 
the salesman, is a good ol* boy" or "June, the author, is a linguist, so this test 
must be better than the one devised by all those nasty psychornetricians") 
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Whats worse, teaching and supervisory personnel are rarely consulted or 
trained in these matters, thereby heightening the likelihood of hostility toward 
or mistrust of a particular test. Needless to say, facite test selection frequendy 
results in the use of inappropriate instruments for the populations and school 
programs of inteie**. 

Some test manuals also report inappropriate statistics or incomplete sta- 
tistics, and one is left to wonder about the adequacy of the test. Inter-rater 
reliabilities, for example, are important to report when a test requires an ex- 
aminer to transcribe what an examinee says or to make judgments about the 
r accuracy of his/her utterances. Estimates of the internal consistency of a test, 
while appropriate for language proficiency tests or techniques which are un- 
speeded or at least not highly speeded, must nevertheless use a special tech- 
nique whenever the individual "items" are not behaviorally independent of each 
other (American Psychological Association, 1974) as in the case of cloze tests. 
Also, it appears to me that the test-retest method of estimating reliability, which 
is not particularly useful to most traditional psychometric applications, merits 
further attention in language proficiency measures where an examinee must 
formulate a verbal response. 

3. Test Use: Misuses and Abuses 

Once a test is adopted many institutions find it "inconvenient** to remove 
it from use. In some cases, institutions and researchers continue to use obsolete 
tests which even the test publishers no longer market. The point to remember 
here is that unsecure tests, like dairy products ~.t the super market, have a 
certain "shelf life": Their validities "spoil** as their content becomes known and 
outmoded. Tests must be either revised periodically or relegated to the shredder. 
The MLA Foreign language Proficiency Tert for Teachers and Advanced Stu- 
dents, as an illustration, is no longer published by ETS, yet one can still find 
yellowed copies of it be«ng useu to screen bilingual teacher candidates) 

Other sources of misuse stem from t!.e .nindless, routine testing which 
actually benefits no one. Incredible amcu its c: money, for example, are spent 
in perfunctory evaluations an<| assessments. One of the "necessary evils" which 
teachers and administrators have unfortunately learned to live with is the 
testing associated with evaluations intended merely to keep some governmental 
agency pacified (i.e., I>y meeting us requirement to conduct and evaluation). 
The resultant mountain of i^st d: ta is never utilized to help individual students 
or to review the program's ft met onirig in detail. Test results are very often not 
reported at ail to instructional and supervisory personnel, or are reported in 
an untimely manner. I hasten to point out that even tests devised by linguists 
are subject to these kinds of misuses. 

Tests are abused when they aie applied to persons for whom they are not 
appropriate, when they are used without verification in situations for which 
they wert not intended, or when their results are overgcneralized. Certain 
language proficiency testing formats may not be generally efficient for all Amer- 
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ican or foreign populations; a test of English language proficiency may not 
be a valid tool for relegating minority students to non-academic curricular 
tracks; nor can a person who doesn't speak much English be considered "cul- 
turally deprived." The roots of these abuses can generally be traced back to 
researchers' and practitioners misunderstandings of what tests are and their 
misinterpretation of the measurements which tests yield. (See American Per- 
sonnel and Guidance Association, 1978. ) - 

4. Concluding Remarks 

i 

I have departed somewhat from my originally prepared remarks not to 
condescend to present elementary points about testing but to share some of 
the fundamental and hard won lessons of my profession with you. During this 
conference I have heard a number of scholars in the sevejjj sulxlisciplines and 
specializations of linguistics and language teaching make statements about tests 
and testing which are basically in error. (A test which reports a reliability 
coefficient of .90 does not mean that the test score is "wrong" ten out of every 
100 times!) What's perhaps more telling is that their statements -seemed to go 
unchallenged except by those of us in attendance whose training or experience 
has steeped us in test development. 

The other night a few thousand of us heard a fascinating* set of character- 
izations (or is it caricatures?): Psychometrioians are supposed to be preoc- 
cupied with reliability, while linguists are concerned more centrally with the 
issue of validity. Such a heuristic is neither accurate nor constructive. I believe 
it is devisive, and that it should be examined. 

In the first place, there seem to be increasing numbers of language and 
testing specialists who are crossing over to learn from one another. Secondly,, 
test reliability is propadeutic to test validity, because test reliability refers to 
the degree of confidence that one can place in an obtained score or classifica- 
tion. Tests which evidence low reliability leave one in serious doubt as to 
where to place an examinee on a soak of language comprehension, fdr example. 

Finally, validity is not all of a piece/ as it were. There are many kinds of 
test validity, and a test which doesn't meet a linguist's, criteria for content 
validity may .nevertheless have excellent predictive validity and utility for cer- 
tain consequential decisions, such as educational placement in a particular 
language program. 

And here, perhaps, wi 1 can join the issue .and learn from one another, One 
of the true characteristics of psychometrioians is their empirical bias; they 
have a real "show me" attitude towards any test or technique, and it is my 1 
devout wish that you will respect this dear if sometimes chaffing propensity, 
for it is founded on mostly experience. If linguists indeed can produce better 
language tests than non- linguist psychometricians, then it is because they have 
devised demonstrably superior methods of obtaining language data and ensur- 
ing its more accurate interpretation and application to issues and problems 
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w ^ich are worthy of measurement, the pr " sional concerns of the TESOL 
membership. 

So I now propose renewed efforts for us all to collaborate, not because 
psyghometricians are not producing valid language tests, but because together 
linguists and psychometricians might increase the validity and utility of lan- 
guage measures while satisfying the requirements of our respective disciplines. 
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cessor son sometime in the third millenium before the first -coming of Christ: 
"Be a craftsman in speech [so that] thou mayest be strong ... the tongue is 
a sword . . . and speech is more valorouf than any fighting . . " (Lafay, Stan- 
field, and Glanzman, 1978). These words were inscribed on a cuneiform tablet 
recently found among the ruins of the great empire of Ebla. Sometime later a 
Chaldean named Abraham settled in that same land, and some centuries later 
still one of his children wrote "The Gensis record: In the Beginning God 
created. . . , w In the beginning was the Word . . Much Jater still, but less 
than a century after the crucifixion, John paraphrased and expanded on the 
Genesis aocount writing and wrote again, "In the beginning was the Word, 
and the Word was with God, and the Word was God" (John 1:1, New Amer- 
ican* Standard), And in verse 14 of the same text he wrote, "And the Word 
became flesh and dwelt among us . . . rt {John 1:14, New American Standard). 

More recently, just before his deat|i, the Russian psychologist Lev Vygotsky 
(1934) avowed that Faust was rtiore correct v*ben he said, "In the beginning 
was < the deed." Vygdtsky contended that "action was there first" and that the 
word "is the end of development, crowning the deed" (p. 153). Of course, 
neither Moses nor John were writing about child development, but rather about 
the creative intelligence responsible for the existence of the universe. In either 
case, there is a fundamental problem in trying to put actions ahead of the 
organizational power of the word. Is it possible to conceive of meaningful 
action without a plan? There is a serious question whether any order can exist 
biologically, physically, or otherwise apart from something very much like the 
prepositional forms of language. Consider for instance the much celebrated 
words and phrases of the genetic code, or the conversational ^changes between 
the cells during embryonic development. Apparently geneticists 'do not doubt 
that such conversations take place, The debate only concerns whether the con- 
versations are long and interesting or short and boring (Wolpert, 1978). Or 
consider the evidence from the research of T. G. R. Bower showing that an 
infant apparently knows even at the instant that it emerges from the matrix 
that a visible finger is tangible (Bower, 1974). Even an act as simple as reach- 
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ing and grasping, as was pointed out long ago by the famous neurologist, Karl 
Lashley, involves the sort of propositional planning (agent, instrument, action, 
object) that is characteristic of the sequential and hierarchic ordering of words, 
phrases, sentences, paragraphs and discourse in natural languages. % 

An interesting question to ponder is whether the order that is observed 
in the physical universe, the genetic code, or the knowledge of an infant at 
birth can be character! ed in any form other than the propositional structure 
of natural language or some logical equivalent of language. We might well ask 
whether it is possible for there to be order or a plan apart from the organiza- 
tional force of a kind of propositional logic. Of course, it is possible for an 
organism to behave in art orderly manner without being able to explain its 
actions in words, but even the simplest responses of a unicellular organism' 
such as an amoeba — say, in moving away from heat — have a kind of proposi- 
tional character. The organism is programmed to respond to certain states in 
specified ways. Whether the program is acquired or innate is not the issue. 
The nature of the program is the issue. Could it be construed as anything other 
than a kind of propositional calculus or logic? 

There are many avenues of approach to the fundamental questions we 
are considering here. Perhaps the most powerful method of all is the exercise 
of intelligence in theory building. Empirical methods can also be applied at 
least indirectly by means of specific hypotheses derived from the general theo- 
retical considerations. One of the most appropriate empirical methods of ad- 
dressing these matters is to examine psychological tests. If the deep logic of 
language is in fact fundamental to all sorts of actions and sy steins of order, 
it should certainly be the case that tests requiring complex problem solving or 
systematic and sequential cognition should be substantially related to tests of 
the ability to process language. Putting the matter more precisely, it should be ■ 
possible to demonstrate a substantial comnuinality across tests aimed at purely 
verbal processing, such as filling in blanks in a cloze test where every nth word 
has been deleted from a text, and tests aimed at non-verbal processing such as 
solving visual puzzles or seeing similarities or complex relations in visual pat- 
terns, and so forth. In fact, we should find a substantial overlap in variance 
(in the statistical sense) in tests of widely different sorts such as taking dicta- 
tion or doing arithmetic problems. 
\ A few years ago when some of these questions were laised seriously for 
! the first time, two types of reactions prevailed. Some of the theorists (so-called 
"expertsi) .said that language tests could not possibly predict very much of the 
variance in non-verbal tasks and others said so what if they could. Hardly any 
of the educational and psychological specialists would admit the seriousness of 
the question. Some were willing to concede the point on the presumption that 
one has to understand and produce intelligible language 111 order to get along 
in a complex modern society, hut this is known by everyone and doubted only 
by fools. So what! ! ! Other specialists followed an equally superficial logic con- 
sidering only the outward appearance of tests and reasoning that no matter 
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what any theory might suggest to the contrary, doing arithmetic is not the same 
as filling in blanks on a cloze test and that's that! ! ! A few specialists even 
claimed that the importance of language ability to tests or ail sorts was so well 
established and so well researched that no further work was needed. They 
harked back to the work of the giants in educational measurement and their 
acquiescence concerning the importance of "verbal comprehension" to all kinds 
of tests. 

All of the foregoing arguments, however, were hpsed on an exceedingly 
impoverished understanding of the nature of language proficiency, This is not 
to say that what is known today is the final word by any means, but it is cer- 
tainly to be expected that recent advances in linguistics, psychometrics, and 
language testing research have increased our knowledge of the nature of lan- 
guage ability. Interestingly, many of the surprising discoveries of recent years 
have come from the field of second language testing research. In fact, findings 
in this field, which defied the predictions of prevailing theories, set the stage 
for the present controversy over the validity of educational and psychological 
measures of all sorts. 

Classical theories of tests and measurement led the initial theorists in 
foreign and second language testing to expect certain outcomes which have 
never materialized in any of the empirica 1 studies. Naive structural linguistics 
and contrastive analysis also contributed to some of the speculations that were 
bandied about in the literature as dependable truths. For instance, the popular 
method of testing foreign language skill known as dictation was rejected by 
Lado (1957) on the grounds that it did not segment the various elements* of 
structure and aspects of skills in the manner required by what was later called 
"discrete point" theory by Carroll (1961). According to Lado, and others, it was 
essential to divide language up into its component parts and test these com- 
ponents one by one. For instance, vocabulary, syntax, and phonology all had 
to be tested separately as did productive and receptive repertoires, as well as 
literacy and what has come to be called "oracy"'. In fact, each element of many 
different inventories subdivided and subclassified in various ways each had to 
be tested separately. For example, at least in theory, it was necessary to assess 
receptive vocabulary in literacy and productive vocabulary in literacy; recep- 
tive vocabulary in oracy and productive vocabulary in oracy; etc., etc., etc. 

This kind of thinking did not originate in the field of second language 
testing. In fact, it went back to the heyday of intelligence testing and the 
writings of people like L. L. Thurstone and T. G. Thurstone (1941) who in- 
sisted that complex mental abilities could be decomposed into multiple primary 
abilities. They argued for the description of mental ability in terms of a "profile" 
which would display an individuals disparate potentials on a variety of primary 
abilities measured by separate subtests in a battery of intelligence tests. They 
wrote, 

there is nothing wrong about using a mental age or an intelligence quotient if it is 
understood as an average of several tests. The error that is frequently made is that 
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the intelligence quotient is sanctified by the assumption that it measures some basic 
functional unity, when it is lyiown to l>e nothing more thm a composite of many 
functional unities (p. 8). 

L. L. Thurstone was particularly concerned to dispute the argument of Spear- 
man (1904, 1927, and 195Q) who had produced strong empirical and statistical 
evidence for a general intellectual ability that seemed to be rather unitary. 

The point here is not to defend Spearman's theory of general intelligence, 
but to point out the connection between the kind of analytical thinking that 
came to pervade the whole field of psychometrics and the discrete point ap- 
proaches to language testing. We m£1I return to the general intelligence factor 
below and propose that it is identical with a general factor of language pro- 
ficiency. In the meantime, it remains to consider some of the empirical findings 
of second language testing research that contradict the claims of discrete point 
theories. The treatment here is necessarily sketchy :md incomplete as other 
works have dealt with this matter in much greater detail (Oiler, 1976b, 1979b). 

Until about ten years ago, little empirical research on language testing had 
been done. Rebecca Valette had shown that dictation was about as good a' 
measure of overall proficiency in French as a more elaborate objective examina- 
tion as early as 1964, but the application of correlation and more sophisticated 
statistical techniques was almost unknown to the field of language teaching and 
testing until the late 1960's and early 1970 s. In 1969 it was- discovered that the 
dictation portion of the UCLA English as a Second Language Placement Exam- 
t ination (Form 1) correlated more strongly with each of the other subscores 
than any of them did with each other. This finding was rather surprising as it 
contradicted the £xpectatiorao^the popular theorists. It showed that something 
about the dictation procedure was capable of generating more meaningful 
variance across individuals than any of the other tests (01]|£, 1970). The puzzl- 
ing question was why should dictation correlate so strongly with tests that 
didn't even require auditory processing of speech signals? 

The unexcepted strength of the correlations observed between dictation 
and other testing methods was subsequently borne out in repeated studies 
(Johansson, 1973, Oiler and Streiff, 1975, Whitaker, 1976, and others). More- 
over, several other testing techniques were later discovered to have similar 
robust properties. For instance, the cloze procedure which had been widely 
studied with reference to reading ability in first language research proved to 
be an exceedingly robust tool for testing language profficiency in general (Oiler 
and Conrad, 1971 Oiler, 1972, Stubbs and Tucker^l974, Hinofotis, 1976, and 
many others). Swain, Dumas, and Naiman (1974) obtained similar results with 
elicited imitation (simple repetition j^l a testing tool. In 1976-1977 at Southern 
Illinois University's Center for English as a Second Language, a number of 
othei testing techniques were added to the list (see Scholz, Hendricks, et al, 
1979). Controlled essay and oral interview proved to be about as effective as 
dictation, cloze, and elicited imitation,, and other techniques are still being 
added. For instance, Robert Scott, Susan Rubin, Mary Galvan, and Gustavo 
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Gonzalez at Resource Development Institute in Austin have produced evidence 
showing that a command test in the tradition of J. J. Asher's approach to lan- 
guage instruction is also an extremely effective testing procedure. 

Research at RDI in Austin shows that a wide variety of testing techniques 
such as those mentioned in the preceding paragraph generate reliability co- 
efficients in the high nineties and validity coefficients in the high eighties and 
low. nineties. How is it that such global testing procedures can be so robust 
and effective? Furthermore, how is it that such radically diverse techniques as 
oral interview, written cloze, dictation, essay, elicited imitation, imperatives, 
and the like correlate so strongly with each other? Could they all be tapping 
the same ability? 

The popular discrete point thinking of the 1950's, and 1960's could not 
encountenance the results of the research. From the vantage point of the dis- 
crete point theories, there simply is no explanation for the strong correlations 
that have been found consistently across radically different methods of testing. 
Moreover, discrete point theorizing resulted m muny predictions that turned 
out to be incorrect For instance, if a vocabulary test in a written mode is really 
measuring something other than what is measured by a grammar test, how is it 
that both may correlate more strongly with a dictation or a cloze test than 
they do with alternate vocabulary and grammar tests'^ Discrete point theory 
can offer no explanation for the fact that tests aimed at different components 
are often apt to correlate as strongly or more so than tests aimed at the same 
components of language^) roficiency. More problematic still is the fact that a 
.test that is not even aimdd at any single component (such as,a cloze test or a 
dictation) may measure each component more validly than do the tests that 
are designed to di, so. For instance, a phonology test or a pronunciation score 
may correlate more strongly with a dictation, say, than with another phonology 
or pronunciation score. (See research cited above and references there, also see 
Oiler, 1979b). 

As the results came in from year to year, it soon became clear that there 
was something radically amiss about the analytical views that discrete point 
language testers had adopted fiom intelligence testing literature. Why should 
tests that appear very different at the surface produce results that are quite 
similar while tests that appear very similar are apt to yield less similar results? 
How is it that the analyses of language skill into so many categories was lead- 
ing to unexpected outcome? An answer to these puzzles was not long in coming. 

In fact, long before the results' of the language testing research began 
to disturb the discrete point theorists, there were sound theoretical reasons to 
expect those results and to reject the thinking of the discrete point approaches. 
Who would propose that you can assess the efficiency of a machine by taking 
it apart and testing the parts without reference to the function for which the 
machine was intended? Surely if such an approach is not applicable to pure 
mechanisms it is even less applicable to something as complex as a human 
language user. Imagine trying to test the efficiency of a racing car by rolling 
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all four wheels down the track at top speed without the rest of the ear! Is this 
any more absurd than testing phonemic contrasts and claiming to test language 
ability? Or testing surface morphemes and claiming to measure language ability? 
It would not even make sense to add the scores on a great diversity of discrete 
point tests and call the total a measure of language proficiency. Adding a bunch 
of invalid scores does not produce a valid score. If discrete point tests happen 
to produce some meaningful fiance and the composite of several of them 
turns out to be fairly reliable, it will have to he purely accidental because the 
theory on which the various subscores are based is quite simply incorrect. 

Why should tests that appear very different at the surface produce very 
similar results? If we consider what is to be learned from a pragmatic theory 
of language use it is obvious that surface form is a subordinate and in many 
case inconsequential aspect of language in use. For instance, suppose you 
receive news that the bank will make the loan you are applying for to pur- 
chase a house. Does it matter whether the message is delivered by phone, or 
mail, or face to face? Is the number of plural morphemes used in the message 
in anyway significant? Suppose the bank officer, whom you know, merely calls 
and says, "You're go on the house." wouldn't this be just as effective as a 
lengthy letter? Does it matter if a passive or active sentence structure is used? 
The obvious fact is that the surface form of a message is largely unimportant. 
It is the meaning or pragmatic import of the message that matters. 

Consider the possibility that the caller might repeat the message in exactly 
the same words ten tunes— or perhaps even lepeat the call ten times. Would 
each of these repeated surface forms have the same consequence? Absurd! 
The point is that the surface form is consequential only insofar as it affects 
the message or helps or hinders the communication. Even identical surface 
forms do not ha\e identical consequences from a pragmatic point of view, and 
totally distinct message forms m completely different modalities may in fact 
have (for all practical purposes) identical pragmatic consequences. 

Therefore, we should not be surprised by the finding that there is a power- 
ful relationship across language processing in different modalities. In a very 
important sense, the problem of communication is the same whether the mes- 
sage is written or spoken and regardless whether the task is to send it or inter- 
pret it. E\eu when the purpose of language use is not so much to convey 
information as it is to enjoy phatic communion or private re\elry, the focus 
of attention is still away from phonemes and morphemes and gramatical struc- 
tures per se. As the earliest writeis on pragmatics insisted many years ago, the 
totality of a language experience is greater than the sum of the distinguishable 
parts. It is not in flfct the parts per se that have importance but rather their 
arrangement in relation to each other and to experience— their temporal se- 
quence and their inferential connections to states of affairs, The essence of 
meaningful language use tianscends the surface form of the words in much 
the way that the human body is more significant than the clothing we happen 
to put around it or in the way that the personhood of the human being 
O 
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transcends the physical stuff of which the body is made. Wouldn't it be jitst 
a little pretentious for an anatomist to claim he could fully describe the nature 
of the human organism in terms of measurements of the various parts of the 
bodies he had studied in his laboratory? Is it any less pretentious for linguists 
and educators to claim they can measure language proficiency in terms of a 
superficial analysis of utterance forms and sentence patterns? 

Apparently, the reason that prap^atic testing procedures work as well as 
they do is because they bear some rest nblance to the sorts of applications that 
people make of language in normal experience. In fact, such tests meet two 
naturalness criteria: first, they require the processing of verbal material under 
normal temporal constraints; and second, they require the pragmatic mapping 
of that verbal material onto experience. Put more simply, they require the time 
constrained processing of meaning. 

There are reasons, therefore, to expect certain testing procedures to work 
considerably better than others. In particular, tfrere is no reason to suppose 
that a test aimed a phoneme discrimination will be a particularly good measure 
of anything of consequence—certainly not the ability to use the phonemes in 
question in normal communication. On the other hand, there is every reason 
to expect testing techniques such as cloze procedure, dictation, elicited imita- 
tion, essay, oral interview, and other methods which meet the pragmatic natural- 
ness criteria to prove to be highly useful measurement approaches. Further, we 
should expect the latter techniques to be strongly correlated with each other. 
Just how strongly is perhaps the most important outstanding research question 
in current studies. 

In 1976 it was suggested that perhaps the general factor of intelligence 
proposed by Spearman (1904) to explain the overlap in variance on many 
complex problem solving tasks was really a language proficiency factor. At that 
time considerable evidence from second language testing research was offered 
showing that a single language proficiency factor could account for the vast 
majority of the variance in a number of diverse testing procedures (Oiler, 
1976a). Since the testing procedures for measuring language proficiency were 
considerably better researched at that time than they had been in the early 
1900s when Spearman proposed his theory of general intelligence, it was sup- 
posed that perhaps what Spearman thought to be a general cognitive capacity 
was more accurately characterizable as a global factor of language proficiency 
—an acquired kind of knowledge or skill. This is not to minimize the importance 
of the factor nor to propose that it is not an essential ingredient in learning, but 
it is to say that whatever the tests measure may be. much more closely related 
to background and experience than it is to genetic inheritance. If this huqph 
were correct, it would remove any vestige of empirical support for the Jensen- 
Herrnstein hypothesis about the hcritability of 1Q. No one can reasonably 
claim that proficiency in a particular language is inherited. The capacity to 
acquire language surely is inherited, but this is not what we pre measuring 
when we assess acquired language proficiency— at least we are jnot measuring 
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the inherited capacity directly, only its results. Moreover, *.*>e know that lan- 
guage proficiency is highly malleable and is largely de ermined by experience. 

The first empirical study with monolingual native speakers was carried 
out in 1976-1977 in Saint Louis by Thomas Stump. He asked how much of 
the variance in the Lorge-Thorndike verbal and non-verbal IQ scores, and in 
the several subtests on the Iowa Tests of Basic Skills would be attributable 
to a language factor measured equally well by cloze and dictation. He tested 
about 100 children at grades 4 and 7 who were native speakers of English. 
His results showed that 54$ of the total variance in all of the tests at the fourth 
grade level was attributable to a language factor and 62$ at the seventh grade 
level. Interestingly, 72$ of the variance in the Lorge-Thorndike non-verbal score 
was attributable to the language factor for seventh graders. The strength of 
correlation across tests was particularly remarkable due to the fact that the 
clpze and dictation testing was done a little more than six months after the IQ 
and achievement testing. 

The second study was carried out by Virginia Streiff in response to Stump's 
lead. In 1977 she analyzed data from Hopi-English bilingual children employ- 
ing the subscores of the California Achievement Tests and both an oral and 
a written tjo/e test. Her result? with 47 children -from grades one through six, 
showed that a single factor accounted *f or no less than- 79$ of the total variance 
in all of the tests. The CAT arithmetic computation score (not the word prob- 
lem score) correlated at .92 with the single factor extracted. That is 85* of 
the variance in that subscore was attributable to a language factor. As in the 
case of Stump (1978), StreifFs language tests were administered some months 
after the CAT. However, in 1978-1979 she was able to replicate her study with 
nearly simultaneous testings and found that a single language factor accounted 
4 for 85$ of the total variance in the tests. The variable that correlated strongest 
with that factor was the writ f en cloze test (no oral test was included in^the 
replication). For the 60 children tested, ranging this time from second through 
sixth grade, 90$ of the variance in the general factor was attributable to the 
cloze score and no less than 77$ to any other score. The arithmetic subtests 
aimed at pure computation and word problems respectively correlated at .90 
and .94 with the language factor. That is, 81$ and 88$ of their respective vari- 
ances were attributable to the language factor. ( See Streiff, 1978, and 1979. ) 

Initially, the possibility of a global factor of language proficiency was 
proposed with respect to second language data. The findings of Stump (1978) 
extended the argument to first language data and those^of Streiff (1978) to 
data from bilitfguak m about the same time that the Stump project was being 
carried out. Flahive ( 1977) was conducting one of the hist studies attempting 
to separate non-verbal intelligence from language proficiency in non-native 
speakers of English. In spite of the fact that he continued to argue in favor 
oi a distinction between non-verbal IQ and reading ability, his data showed 
that in fact the distinction k highly suspect. The non-verbal test he' used was 
one of the measures referred to by Jensen (1969) as a nearly "pure" measure 
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( ,f .^or general intelligence—namely, R'veiA Progressive Matrices. Flahive 
also used the McGraw-Hill Basic Skill System reading Test, the Test of English 
as a Foreign Language, the Perkins-Yorio paraphrase recognition test, and a 
cloze test. For 20 advanced non-native speakers of English, correlations across 
tests ranged from .59 to .84, Interestingly, the strongest correlations were be- 
tween the Raven test and the McGraw Hill (.84) and the TOEFL with the 
Trkins- Yorio test (.84). Why should tests aimed at the same construct correlate 
only about as strongly as tests aimed at totally diiTeient constructs—e.g., read- 
ing proficiency in a second language, and non-verbal IQ? It is especially sur- 
prising that non-verbal IQ might be quite indistinct from proficiency in a second 
language. Nevertheless, it is known that proficiency in the first language is 
s'-ongly correlated with proficiency in a second (Beranson "and Hawkes, 1976, 
Swain t Lapkin, and Barik, 1976, Johansson, 1973, and others). 

Does Flahi ,0 's study provide evidence for a single general factor of pro- 
ficiency including 'he non-verbal IQ test? Empirical study shows the answer 
to be a resounding yes. Table 1 presents the results of a principal components 
analysis (with unities on the diagonal) of the original correlatun matrix from 
Flahives data. The loadings on the first principal component account for so 
much of the total variance that ;i is highly doubtf'il that any reliable variance 
ould ^e left in a^y of the tests n this general component were extracted. The 
^portion ( ,f total variance accounted for by the single component is 74.92. It 

TABLE 1 

A Principal Components Anal>su of the Correlations Reported by 
"^Flahive (1977) between Various Tests Administered to a 
Croup of Advanced Second Language Learners 
N=^20 



Loadings on Proportion of 

(or Correlations Variance in 

with) a General Each Test 

Tests Factor, g Kxplained by g 



Raven's Progressive Matrices 864 .747 

Test of Knghsh as a 

Foreign Language 877 .769 

McGraw Hill Reading Test 866 7^1 

Perkins-Yorio Test 883 

Cloze Test _ . 836 _ 699 

is important to remember that this analysis is completely unique and determi- 
nate (Nunnally, 1967. 317). If we were testing the hypothesis that a single 
^neral factor accounts for the vast majority of the variance in all of the tests, 
we would ' ^ve to accept that hypothesis. ^ 

While Flahne employed non-native speakers of English, )"s f indings in 
concert with previous findings would lead us to suppose thnft what hds formerly 
been general intelligence might actually be a language proficiency factor. At 
riie Boston TESOL meeting in 1979, Chesarek and Oiler presented data from 
a bilingn 1 population of English-Crow speaking children from Montana. Tests 
included not only the Raven Progressive Matrices, but also CattelT.s Culture 
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Fair Intelligence Tests, Scale 2, Forms A and B. These tests are all aimed at 
non-verbal intelligence. Also included were the Peabody Individual Achieve- 
ment Tests (Mathematics, Reading Recognition, Reading Comprehension, 
Spelling, and General Information), the Peabody Picture Vocabulary Test, sev- 
eral subscores f>uir ti^e Illinois Test of Psycholinguists Abilities (Auditory 
Reception, Auditory- Vocal Association, Visual-Motor Assoc'ntion, Verbal Ex- 
pression, and Cramniatic Closure), and a syntax test developed by Ursula 
Bellugi. Results from first, second, and third grade children revealed large and 
stable loadings on the first p*' icipal component extracted by the principal ^es 
method. Moreover, on the average no reliable variance was left in any of the 
tests if the general factor was taken out. The mean specificity (i.e., the propor- 
tion of reliable variance unique to each test) was — l.lf, while the average 
proportion of variance in each test that was attributable to a global language 
factor was no less than 542 of the total. Although the numbers of subjects 
studied in each sample were not large — they ranged from 22 to 78 in the three 
samples — the results do not support the common view that non-verbal skills 
can clearly be distinguished from language ability. Specificity estimates for 
the non-verbal IQ tests were highly unstable ranging from a low of —,18 to a 
high of ,14. 

In sum, data from the bilingual population studied simply does not support 
the common claim that multiple primary\factors (see Thurstone and Thurstcne, 
1941) can be distinguished. In particular, the argument for a clear separation 
of verbal and non-verbal abilities seems to be strongly refuted. But what about 
the case with native speakers? Is/.'t it possible that a bilingual population flight 
produce results radically different from those in a monolingual population? To 
answer these questions in part, data from the Illinois Test of Psycholinguistic 
Abilities were examined. The Manual to the experimental edition reported 
result f a principal components analysis of the nine subtests in the original 
battery Based on a sample of 700 children ranging from 2Vz to 9 years of age. 
AH of then! were presumably native speakers of English. A least four of the 
subtests could be construed as requiring "non-verbal" processing of various 
lorts — Visual Decoding, Motor Encoding, Visual-Motor Sequencing, and Visual- 
Motor Association These tasks required classifying pictured objects (using a 
*impje matching procedure), howing how certain objects are used (e.g., a gun, 
or a violin), placing i rt{ects in a previously displayed irrangement, and in- 
dicating which objects go togethei, respectively Specificity estimates for these 
tests ranged from a low of —.24 to a high of .09. Again, the specificities were 
highly variable and unstable whereas the loadings on the first principal com- 
ponent were large and stable accounting for 80? of the total variance in all 
nine subscores. The mean specificity estimate, however, was a —.1?. 

The data so clearly support the hypothesis of a single global factor of 
Lnguage proficiency that there cannot remain much question about the exist- 
ence of such a factor The question that does remain is whether there is any- 
thing else left if this factor extracted from the reliable \«trianc< in the tests. 
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It also remains to extend the findings from bilingual and monolingual children 
to bilingual and monolingual adults. 

At the 1979 Boston TESOL meeting, Paul Streiff presented findings from 
Navajo youngsters up to the ninth grade. He investigated the various subscores 
on the Comprehensive Tests of Basic Skills along with written cloze tests. 
Approximately 50% of the variance in all of the tests was attributable to a 
single language proficiency factor across each of the three samples examined. 
Interestingly, there was a tendency fin the amount of variance attributable to a 
general language factor to increase as children progressed through the grades. 
This result was consistent with the findings of Stump ('978) For a more 
complete discussion, see P. Streiff ( forthcoming )« 

While there is still a gap in our knowledge at the secondary and posN 
secondary level for bilingual populations, data on high school age young adults 
who are monolingual speakers of "English is available for the Adult Performance 
Level Survey (Oiler, 1979a ), The APL is among the popular new breed of tests 
aimed at so-called "real hie competency'. In fact, this instrument is the proto- 
type of competency te c ^ in much the same way that the" Stanford-Binet was 
the prototype for IQ s. It was pretested on 2,853 subjects from 36 states. 
The subtests in the APL battery are aimed at two categories of competency: 
content areas (including community lesources, occupational knowledge, con- 
sumer economics, health, and government and law) and skills (identification 
of facts and terms, reading, writing, computation, and problem solving). The 
correlation matrix foi the original test population (reported in the Manual) was 
examined by several statisticaLmethods, All of them revealed a large variance 
overlap across subtests, A principal factoring method (distinct from principal 
components analysis in regard to the handling of diagonal elements) with 
estimates of reliable variance on the diagoi.al of the correlation* matrix showed, 
100* of the reliable variance (within rounding error) to be attributable to a 
single general factor. 
N How can we4)e sure that the general factor identified in all of the preced- 
ing studies (and others not cited) is in fact a global factor of language pro- 
ficiency? Could it not be an intelligence factor along the lines of classical 
theory? There are actually several sorts of evidence and argument that can be 
brought to bear on these questions. Among them are simple examination of the 
correlations v i tb* 1 general factor, content analysis of the tests, and theoretical 
reasoning. All of these sourus of argument suggest that the 'general factor we 
have been talking about in all of the forgoing studies must be a language 
proficiency factor*. For instance, the tests that conelate mo>i strongly with the 
general factor (or first principal component) in the statistical analyses are 
consistently language processing tasks, Foi example, the test producing the 
strongest correlation with the first principal component extracted from the APL 
inventory broken down by skiiK was the reading subscoie Similaily. the cests 
pioducmg the stiongest correlations with the first principal component on the 
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ITPA were the Auditory- Vocal Association Test (.96) and the Grammatic 
Closure Test (.92) — each of which involves filling in blanks in sentences. 

Another source of reasoning that arrives at a similar result is to consider 
tJie content of the so-called "non-verbal" tasks. It turns out that the complex 
„ reasoning necessary to soke the Raven Progre* ive Matrices or Cattell's prob- 
lems, or arithmetic problems, visual analogies, etc., almost certainly involves 
the deep structure of propositional forms that art 1 primarily linguistic in nature. 
On the other hand, if we examine the content of the language tasks with which 
the "non-verbal" type of processing might he associated, we come against a 
blank wall. How, for example, could we reason that filling in blanks in a passage 
of prose about, say, a cat named Traveller* involves arithmetic computation? 
Or the solving of non-verbal analogies or abstract matrix puzzles? The so-called 
"non-verbal" tasks most certainly involve deep level language ability, but the 
normal processing of discourse does not seem to necessarily involve the solution 
of abstract visual puzzles or mathematical problems. Therefore, the content 
analysis of the tests supports the interpretation of the general factor as a lan- 
guage proficiency factor. 

Yet another source of evidence on the question is theoretical reasoning 
regarding the nature of language proficiency. We should expect to be able 4o 
measure the global factor of language proficiency with a variety of language 
processing tasks. In fact, the correlations across tasks that are quite different 
at the surface should, under specifiable circumstances, roughly approximate 
the estimated reliabilities of the respective tasks. That is, if a written cloze 
test and an oral interview procedure are both tapping the same global factor 
of language proficiency, the respective correlations of those tests with the global 
factor in question should roughly equal the reliabilities of the several tests. This 
result is a logical necessity if the tests are really measuring the same underlying 
ability. Moreover, such a finding would support the view that the underlying 
ability is in fact rather unitary To show that the ability in question is com- 
plex in the sense argued earlier by Thurstone and Thurstone (1941) and by 
many other theorists, would require the demonstration that stable specificities 
exist which can be associated with multiple unique factors. * 

The problem of demonstrating specificities (that is, reliable Variance unique 
to particular tests) is compounded by thr fact that in many of the original 
studies on the structure of intellect, the tests employed had remarkably low re- 
liabilities to start with. While correlations across language processing tasks of 
many sorts can be expected to mount into the .7 to .9 range, correlations across 
many of the multifarious tasks used in previous investigations of "intelligence" 
in its many posited forms have usually been much lower For instance, in the 
classic study by the Thur stone's, correlations across their 63 measures (3,969 
in all) scarcely ever exceed .6 and are mostly below the level of 2. The sig- 
nificance of the relative magnitudes of the correlations in question can be ap- 
preciated by considering the fact that a .2 coi relation indicates only a 45f vari- 
ance overlap while a correlation of 8. say, indicates a variance overlap of 64%. 
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Although it jean be argued under certain highly restrictive conditions that low^ 
correlations indicate that tests are validly tapping separate abilities, that is 
almost certainly not the case here. Here, the tests are quite probably unreliable ) 
to start with, 

A test can only be said to be reliable if it correlates with itself.^That is, if 
we repeat the measure, we should expect to get somewhat similar results. How- 
ever, it is highly doubtful that tests such as writing as many words as you 
can that begin with the letter T and end with E, or trying to recall from rote 
which first names go with which last names in a previously presented list, or 
writing al! the words you can think of that begin with D, or writing as many 
four letter words as you* can that begin with B, or replacing missing letters in 
isolated words,, are reliable. Although all of these tasks appeared in the set 
used by the Thurstones to investigate the "factorial structure of intelligence", 
they bear little or no resemblance to the sorts of things that people normally 
do with language. Further, they did not correlate strongly with anything — 
not even each other Surprisingly, at least 42 of the 60 tests used by the Thur* 
stones correlated "more strongly with a single general factor than they did with 
the seven separate primary factors posited by them (see pp. 88 and 89 of their 
\ *~M In fact, even the loadings of the tests supposed to be assessing the vari- 
ou. k nary factors (see p. 89) were generally quite weak (rarely above 6). 
Contrast these results with studies of language proficiency where the loadings 
on the general factor are often in .8 to .9 range. 

In conclusion, let us briefly examine some of the most recent evidence ir , 
favor of a global factor of language proficiency from studies of second language 
learners. One of thf* most widely accepted claims of discrete point theory is 
that oral language proficiency can be divided up into various components. This 
claim has been tested carefully now several times over. In every single case 
the evidence shows that multiple oral scales of, for instance, pronunciation, 
vocabulary, grammar, fluency, and comprehension, are quite perfectly unitary 
within the limits of their respective reliabilities. The first careful study ot this 
question came from a dissertation by Hinofotis (1976). She used the five scales 
from the Foreign Service Institute Oral Interview in conjunction with various 
other scores including scores on the Test of English as a Foreign Language. 
A principal components analysis of her data revealed a single language factor 
accounting fo no less than 85? of the total variance in the procedures studied 
(Oiler and Hinofotis, 1979). Loading': of the various subscales of the FSI In- 
terview on the global language factoi were roughly equivp'eut to their relia- 
bilities and there was no tendency whatsoevei for tests aimed at the con- 
structs of pronunciation, vocabulary, *»d grammar to correlate more strongl/ 
within categories than across categories. In other avoids, the various vocabulary 
scores obtained were no more stiongly correlated with each mwv than they- 
were with grammai scores, or measures aimed at other constructs. 

More recently. Yoro/u>a and Oiler (in press) have followed up on the 
Hinofotis lesearch to test for the possible existence ot hp!o effect which 
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might have l>een mistaken as a separate speaking factor in previous studies. 
Their results showed a substantial tendency for judges to assign similar ratings 
Jo diverse scales if they were scored on the same occasion. However, ihe data 
aid not reveal any consistent specificity associated with any of the tradition- 
ally distinguished "components" of speaking skill. It would seeri that ratings 
of fluency" or of "vocabulary", etc., are in fact ratings of the same global 
factor of proficiency — perhaps it should be called "communicative effectiveness". 
When Hie halo effect was present the first principal component accounted for 
89$ of the total variance across four scales studied (grammar, vocabulary, 
fluency, an*} pronunciation), and when the halo effect was eliminated by having 
judges assign ratings to each scale on a different hearing of each interview, 
the first principal component accounted for 792 of the total variance. The 
halo effect, thus, seems to account for about 10% of the total variance in such 
scales. There was no stable specificity in any of the scales. In fact, the average 
specificity was less than 0 indicating that the scales were somewhat better 
measures of the global factor than of the separate constructs they were sup- 
posed to be measuring. 

Additional data on this question are available from the correlations reported 
by Mullen (1979b) in a study of the reliability of oral ratings. In her research 
98 foreign students at the university !ev r vere interviewed. Five rating scales 
were used (listening/pronunciation, fluency, grammar, and overall). She argued 
that each scale was contributing something unique to the evaluation of oral 
proficiency, but * principal components analysis based on the correlations she 
reported revealed the results shown in Table 2. 

* TABLE 2 - 

A Principal Components Analysis of iUv Correlations Reported by 
Mullen (1979b) across Scales Aimed at Various Aspects of 
Speaking Proficiency of Second Language Learners 
N = 98 



Proportion of 
Loadings on Variance in 

*i General Each Scale 

Scales Factor, g Explained by g 

Listening .930 .866 

Pronunciation 905 820 ' 

Fluency 926 * 857 

Grammar 915 .836 

Oxerall Rating .980 .960 

Wheicas the "oveiall" stale was appaientb the best all aioiiud measure of 
whatever was being nicastiietl by, the \a'ious scales, all of them measure that 
s*an>e attribute to the extent of their reliability, in tact, the weakest correla- 
tion of an> scale with a single proficiency factoi was .91)5 — accounting for 
82? of the variance in the pionuneiation scale, Tlwic is no reason to believe 
that any of the scales is cotihihiitnig anv unique reliable variance (i.e.. spe- 
cificity) to the evaluation. 
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- 'Finally, a similar set of data on 117 foreign students tested in an essay 
format by Mullen (1979a) yields essentially the same conclusions. In this case 
examinees were as.ked to write an essay and their efforts were rated on five 
.scales: structure, organisation, quantity, vocabulary, and overall. Again Mullen 
concluded that each scale was contributing something unique to the evaluation, 
byt * principal components analysis based on Jhe correlations reported by her 
supports a somewhat different conclusions See Tabic. 3. As in the case of the 

TABLE £ 

A Principal Components Analysis of the Correlatioas Reported by 
Mullen (1979a) across Scales Aimed at Various Aspects of 
E say Performance of Second Language Learners 
N = 117 



Scales 



leading* on 
a General 
Factor, g 



Structure 
Organization 
Quantity 
Vocabulary 
Overall Rating 



,906 
936' 
870 
.931 
982 



Proportion of 
Variance in 
Each Scale 
Explained by g 

.821 
.877 
.757 
866 
965 



dral scales, all of the scales for essay ratings loaded on a single component at 
remarkably high levels. The first component accounted for no less than 85.7$ 
of the total variance and received loadings ranging from ,87 to .98 from the 
various scales. To the extent that the scales are generating reliable variance, 
they seer i to be measuring the same global factor of language proficiency 
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• The Role of Standardized Tests at the School Level 

Paul iosior 

Pago High School, Pago, AZ 

Standardized Tests have come under increasing criticism in recent years. 
This criticism seems to have arisen as the awareness of the educational needs 
of minority children has developed. The validity of standardized tests has been 
and is being questioned, e.g., do these tests measure what they are supposed 
to measure? IQ tests have been the focus of greatest challenge. The construct 
intelligence quotient is difficult to define at best. Determining how to measure 
this construct has created voluminous controversy. 

Standardized achievement tests are now coming under scrutiny. Again 
what is actually measured with standardized achievement tests? One aspect of 
these tests is the measurement of basic skills. Reading is a commonly measured 
skill. But critics have demonstrated that much more than reading skills are 
measured in such' tests. Reading is not an isolated function. It is an aspect of 
language, and language is the means by which we define and describe our 
environment, and culture. Consequently, a reading test reflects not only read- 
ing skills, but language manipulation skills and environmental-cultural knowl- 
edge. 

Standardized tests Avill reflect by their very nature, environment and cul- 
ture. No test is language free and no language is culture-free. Therefore, cul- 
ture and language will be reflected in any test. In the case of standardized 
nationally-normed tests, the culture will be the one reflected by the majority— 
the middle, class, and the language will be English. 

Obviously, children who do not manipulate English well ( i.e., those who 
have limited language structures and limited vocabularies and children who 
have environmental and cultural experiences different from the middle class) 
generally will not do well on standardized tests. This is the -situation in whieh 
most of minority children in this country find themselves. They speak. limited 
or non-standard English and their environment and culture are quite different 
from that of the middle class. These children will be at a disadvantage in tak- 
ing such tests until they develop proficient English skills and acquire experi- 
ences similar to the middle class. 

With these inherent weaknesses, what should be the role of standardized 
tests at the local school level jvvhere school populations include minority chiK 
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dren? There is no absolute answer, but I will attempt to provide a plausible 
response by examining the following aspects of the question: (1) the use and 
abuse of standardized tests; (2) what standardized tests measure and (3) 
what comprises a comprehensive testing prograjhi. 

1. The use and abuse of standardized tests 

The most commonly used standardized tests administered in schools are 
achievement tests. Probably every student enrolled in either public or piivate 
school has taken at least one, and very likely many, batteries of standardized 
achievement tests. Testing can begin (and often does) in kindergarten with 
a set of standardize \ readiness tests. 

Many states are now mandating standardized testing and /or are setting 
standards that require measurement by norm-referenced tests. The state of 
Arizona requires both the use of standardized tests and standards measured 
by norm-referenced tests. At the beginning of third grade all student? in 
Arizona public schools are tested in reading. Presently the Stanford Achieve- 
ment Tests are administered by local school districts state wide in October. 
Entering fifth grade students are required Jo take a standardized test in math- 
mat ics. 

Arizona requires th^t all students graduating from eighth grade must read 
at a minimum level of 6.0 grade-level-equivalent years. Seniors, in order to 
graduate, must read at a minimal grade-leyel-equivalent of 9.0. Each local 
school district must file a plan with the state for determining the grade-level- 
equivalents of 6.0 and 9.0. The determining of whether or not students have * 
achieved these levels requires the use of a locally normed or nationally normed 
measurement instrument. 

Federal programs such as ESEA Title I often require evaluation procedures 
which incorporate some type of standardized testing. T >aally this is a pre-post 
testing program in basic skills. 

What effect does the use of standardized tests and ..tate imposed standards 
have on minority children? I use Navajo children as the primary example since 
these are the children with whom I have worked for the past ten years. Navajo 
children on the average score as follows on standardized test achievement: At 
the third grade level these children score 1.0 — 1.5 grade-level-equivalent years 
below the national norm, at the sixth grade level Navajo children average 2.5 
—3.0 grade-level-equivalent years below the national norm. After twelve years 
of school Navajo children score on the average between 4.5 — 6.5 grade-level- 
equivalent years below the national norm. 

Obviously, most Navajo children will not meet the state mandated stand* 
ard* for graduation from eighth grade and twelfth grade if the national norm 
is used as the standard. Many school districts serving Navajo children have 
determined their own norm and have developed an instrument locally to meas- 
ure student achievement. This local norm often is well below the national norm. 

The goal of the Arizona mandate was to increase the reading ability of 
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students. Rather than attempting to improve the educational program to in- 
crease the reading ability of Navajo children, many districts have circumvented 
the mandate and have developed an evaluation system that allows the status 
quo to continue. f 

The use of standardized tests in Federal programs (Title I) reveals a most 
interesting situation. Programs first must demonstrate need. This is usually 
achieved by demonstrating with standardized tests that children score well 
below the national norm. Once funds are received then results must be demon- 
strated. If results are too great and children score at or near grade level, federal 
funds are terminated. The dilemma is obvious: improvement is needed, but 
not tco much. 

While conducting research, I encountered a situation that demonstrates 
abuse of standardized testing with Navajo children. The research involved two 
standardized tests used to measure reading achievement: (1) the Stanford 
Achievement Test (SAT): and (2) the Metropolitan Achievement Test (MAT). 
The total reading subtests of the two tests were quite different. The SAT sub- 
test was composed of a subsection on phonics skills and a subsection on para- 
graph comprehension. The MAT total reading subtest was a combination of a 
word knowledge (vocabulary) section and a paragraph comprehension section. 
Children in four or eight schools used in the research study were tested on both 
instruments. The tests were administered to children in grades second through 
sixth. The average scores at each grade level in all four schools were lower 
on the MAT than the SAT. The word knowledge section of MAT was devas- 
tating to Navajo children. Although Navajo children in general scored well 
^ below the national norm on the SAT, the Bureau of Indian Affairs has chosen 
to use the MAT reservation wide for students' achievement evaluation. The 
reason for th ; s choice seems apparent. This raises the question of the purpose 
of testing and the use of results. 

The greatest abuse of standardized tests is the use of the results. This is 
particularly true for individual student test scores. Test results are often used 
to predict expectations both formally and informally. Children who score 
similarly on standardized tests are often grouped together for instruction and 
such groupings can be an effective method, However, minority children all too 
often are grouped together and are labeled as the "dull" groups. Teachers re- 
view the standardized test scores and often interpret test scores as aptitude. 
Children score low, therefore they mutt be slow or have little aptitude. Con- 
sequently, teachers expect little of these childien and the self-fulfilling prophecy 
occurs: children do little. 

2. What standardized tests measure 

Standardized achievement tests are verbal instruments tSat require lead- 
ing and symbol manipulation. No matter what skills or concepts are included 
in the test, language manipulation is involved in all areas except for arithmetic 
computation. Consequently, standardized achievement tests measure not only 
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specific skills and ct cepts but how well a student manipulates standard Eng- 
lish. The Gates McGinite Reading Test, Survey E t Form 2 provides several 
examples of how the ability to manipulate English syntax is essential to suc- 
cessful performance on the test. Most of the comprehensive questions provide 
answers which are contextually cued. Native speakers immediately eliminate 
those foils that are contextually inappropriate, but limited-English speakers may 
not. To illustrate, here is an example from the test. "Mother Goose rhymes seem 
like innocuous jingles, designed only to entertain small children.? But many 
began as political satire using the nicknames of 27 or court 28 and making 
fun of their activities." The possible answers for No. 27 are (1) infant; (2) 
political; (3) stars, (4) rhymes, (5) animals. Contextually or structurally in- 
nppiopriate foils are numbers one, four and fi\c The entire question implies 
a knowledge of Mother Goose rh vines which many minority children do "not 
experience. Middle class cultural knowledge also is important if children are 
to have reasonable opportunities to demonstrate their skills on such tests. 

Hence, the use of standardized achievement tests with minority children 
often reflects more bow well they manipulate standard English and middle 
class cultural concepts, than how well these tests measure skills and specific- 
concepts learned To a large degree then achievement tests reflect how well 
minority children manipulate standard English and middle class cultural con- 
cepts in relation to middle-class students These tests bv no means reflect the 
actual abilit * or pot< ntial of these children. 

The above into/mation can be useful in setting goals and in designing 
programs. Minority children must compete in an English speaking, middle 
class wGild. In n non moral vvfrV minority children start school behind middle- 
class children, i.e. they often speak limited or non-standard English and have 
had experiences quite different from those on which school curricula aie based, 
in essence these children must learn more a»id learn faster to catch up (the 
Bereiter-Engelmann dilemma) Standardized achievement tests can be useful 
instruments in assessing the success of programs designed to catch these stu- 
dents up with native Knglisb speaking, middle-class students 

3. A compeheusive testing program 

Standardized achievement tests aie not going to disappear, in fact the use 
of these tests ina\ be on the inciea.se. Standardized achievement tests can be 
useful •vhen integrated into a comprehensive evaluation system. 

Standardized. tests alone seem to be of relatively little value They often 
do not measine what has been taught in a given classroom oi giade level. 
These tests do not serve well as diagnostic instruments because they are de- 
sign! to ineasuie the grneia! lathei than the specific They do provide some 
inhumation as to bow students m a tfiven aiea compaie to othei students 
throughout the country 

Used as one component of a < oinpichensiv e evaluation system standardized 
tests can piovide useful information. An effective evaluation system is an out- 
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growth of a well planned and designed auricular program. Objectives and 
goals are developed for specific skills and concepts to be taught in the cur- 
riculum. Evaluation results are used to assess whether goals and objectives are 
met and to determine program direction. 

A combination of a locally developed criterion — referenced testing (CRT) 
program and norm-referenced standardized tests can provide a comprehensive 
and effective evaluation system. The CRT program can assess the extent to 
which curricular goals and objectives are met. This testing program evaluates 
how effective classroom instruction is. What is supposedly taught in the class- 
room is measured. 

The overall goal of any educational program is to develop student skills 
and abilities so that students can become competent citizens/ Minority children 
have the right to such an education. The goals and objectives of ,the school 
curriculum must be designed to assist these childen in developing competencies 
equivalent to those of middle class students. The standardized achievement tests 
component of the evaluation system can be used to assess the progress minority 
students, as a group in a given school or grade, are making toward that goal. 
The data gathered from such tests may be used to review and alter curricular 
objectives. 

The Office of Education has suggested that there be a moritorium on the 
use of standardized tests, I believe this to be an undesirable move. On the 
Navajo reservation some schools have decided not to use standardized tests, 
The Bureau of Indian Affairs has substituted local norms for national norms 
for the, SAT and MAT. The results of these actions have not improved the com- 
petency levels of Navajo children. All that has been accomplished is that medi- 
ocrity has become the accepted standard. 

With all their weaknesses standardized tests can be valuable. These tests 
provide data which describe how well students, through the medium of Eng- 
lish, achieve in comparison to students throughout the country, How the in- 
formation from these tests is used determines their value. If the information is 
used m isolation and for individual students, it is of seemingly little use. If it / 
is used in a comprehensive evaluation system, the information can provide 
program direction. 

Standardized tests are not going to evaporate. The challenge educators 
face is making positive use of these tests. Certainly standardized tests can be 
better designed. School personnel should carefully examine a wide variety of 
such tests before the selection of a specific test is made. The test should be one 
that provides information needed for the norm-referenced component of the 
school's comprehensive evaluation system. We, the school personnel, to a large 
degree decide whether the data gathered from standardized tests are useful 
and valuable or whether they are abusive and detrimental. It is in the role as 
decision makers that we must use standardized tests for no more or less than 
they can assist us in educational decisions. 
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Three Kinds of Bias in Standardized Testing 



Roger W. Shuy 

Georgetown University, 
and The Center For Applied Linguistic* 



Any examination of standardized testing would do well to attend to T. S. 
Eliot's three probing questions: "Where is the learning we have, lost in informs* 
tion? Where is the understanding we have lost in knowledge? Where is the life 
we have lost in living?* The notions that learning can be measured by means 
of retained information and that understanding is reflected by knowledge have 
been challenged by wise people for centuries. At best, testing which is stand- 
ardized to large numbers of takers is guilty of assessing information and knowl- 
edge rather than the learning and understanding which Eliot so highly valued. 
Information and knowledge are the what's of education. Learning and under* 
standing are the whys. 

But even assuming that it is valuable for education to try to measure 
knowledge or information, several major problems obtrude. What is worth 
measuring? What is an appropriate sample of the larger body of knowledge 
which represents that knowledge? What gets in the way of an accurate measure 
of such information? The latter issue — that which interferes with such measure- 
ment — often goes under the label, bias. 

Bias in standardized testing exists at many levels, ^e usually think of 
bias in terms of cultural bias, but it is difficult to think of the category of test 
bias without considering the bias of the general public and the bias of the 
disciplines as well. This paper will discuss these three areas of bias as they 
relate to standardized testing today. 

Education, as every teacher knows, suffers from the attitudes of the gen- 
eral public which thinks it knows all there is to know about education and that 
special thought, study or experience with education is not needed to qualify 
one to make grandiose statements about it. Since educators appear to be under 
siege, careless interpretation of what tests acit "Jly do can be damaging to our 
profession. There exists today an industrial/business mind-set which causes 
the general public to believe that we actually know how to measure such 
things as intelligence, language proficiency, reading, etc. This mind-set, wftch 
appears to be modelled on simpler testing of how well automobiles can run 
or how widespread a television program is watched, leads the general public 
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to believe in the notion of the standard izable testing of human learning and 
to believe that we already know all we need to know in order to do it. What 
all educators know is that human children arc not like automobiles or television 
programs. Nor is learning that finite or controlable. 

This bias of the general public was made clear to me in court litigation 
concerning the Aspira Consent Decree in New York. It seemed perfectly obvious 
to the litigants, to the defendants and to the judge that what was needed in 
New York was a test which would clearly determine in which language, Spanish 
or English, the New York City Puerto Rican children could most effectively 
participate in the classroom (reading, writing, listening and speaking). This 
having been agreed on, all that remained to be done was either to select such 
a test from among those available or to make up a new one to suit their needs. 
The simplicity of this c<\se was apparent to all. The matter was referred to the 
apprc^ iate division of the New York City schools, the research department, 
which did not find the issue quite so clear. In New York, as in other places, 
pressure from court orders, legislation, community groups and the needs of 
school children have demanded immediate response, response before an ad* 
equate body of information has been developed. Furthermore, what is to be 
tested is a complex issue. ()n one level, proficiency in English mast be asies«ed 
as must proficiency in the other language. The other language differs from com- 
munity to community. On another level of complexity is the issue of what to 
test in ill the languages. Which aspects of the language^ system must be ia- 
cluded in a test of proficiency? Which can be inferred from the results of other 
aspects that are tested? What aspects of the language systems can be safely 
ignored'* Currently, tests stress phonological, morphological and lexical items. 
Some involve syntax. None capture adequately the pragmatic and semantic 
aspects of language that, on the face of it, are important elements of proficiency 
in a language Nor lias there been much recognition of the fact that we really 
do not know much about how to test these more crucial areas of measurement 
in order to satisfy the rather nice wording of the Aspira Consent Decree ("to 
effectively participate in the classroom"), ihe business in istry model does 
not permit it. 

In testing, as in most other aspects of education, the business/industry 
model is clearly at work. There is an assumption that wc already know how 
to do the task and that all we need to do is measure it. A further extension of 
the business industry influence can be seen in the immediacy of the request. 
Ironically, we often accuse 1 the Younger generation of wanting their products 
now — in fact w< call our children the now or me generation. It appears that 
they come by this naturally. The general public tends to be unwilling to defer 
gratification (a term we once saved for the underprivileged). It is, of course, 
an error to think that we already have a test to satisfy the needs specified by 
the Aspira Consent Decree and it is equally erroneous to think that we can 
build one without considerable research, thought and time. 

Then* was a time when oi.e could count on philanthropic foundations to 
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see through the superficiality of the husiness/ind- stry model of education. As 
their resources diminish, however, even these philanthropic groups are being 
forced to move toward the need for an immediate, visible, product pay-off. 
the point I am rying to make here is that however troublesome cultural bias 
may be, it is probably no more damaging than the ignorance of the issues ^ 
caused by public bias. Education is not highly regarded in this countiy, as 
evidenced by the fact that most education legislation is hinged to social, 
political or economic concerns. Seldom, if ever, is learning considered for its 
own sake. If education is so invisible to the general public, hiw much more 
invisible is language? Here we have a field in which newscasters can write 
best-selling books in a discipline whose existence they do not even recognize. 1 
Nor does the general public iiesitate to offer sharp opinions about bilingu? 1; 
language standards or clear writing, despite the fact that there is nc \ "1 
scientific agreement about what these concepts mean. Public bias has been •* ic 
recognized in education as a whole, and in language* education in particular. 
In my opinion it is a dangerous enemy of tlw abilities which must be measured 
in, standardized testing of any kind. Any public which thinks we know how to 
carry out such testing while we really don't, will continue to accept specious 
test results. 

A second broad form of bias which affects standardized testing comes 
from the disciplines which are involved. Somewhere, somehow, the uneasiness 
v hich most teachers have with the process of testing was exacerbated by the 
quantificat-..^ leading to great discipluiaiy confusions. The construction of 
any testing instrument irust consist of three components what is measured; 
how it is measured; and what the results mean. Jnst as the business industry 
model has influenced public bias, so has it surfaced in disciplinary bias. The 
delivery system is in danger of becoming the message. In an honest effort to 
improve our methodolegy of how* to measure, we urn the risk of losing sight 
of what is being measured. Put another way, our effort to find reliability can 
interfere with 0111 notion of content validity. 

A classic example of this problem can be seen m the S holastic Aptitude 
Test (SAT). As you know, reliability in.tcsti.ig is the degree of consistency 
between two measures of the same thing. This is probably what the College 
Entrance Examination Hoard docs best. The SAT compares a test- taker's ca- 
pacities with those of his peers and with students who took the test before. 
The test is also correlated with the students' grades in high school and, it is 
claimed, over the years, it has actually improved at doing the only thing it 
professes to do, to predic t success in the first \c«ir of college. Likewise, different 
forms of the test are measured against each other lor reliability. 

Reliability • without doubt, a good tin ig but by itself it does not really 
tell us very much because am test can be technically rc^ahle and still fail 

1 In a recent conversation with news commentator, KcK Newman, for exampl'% I ex- 
plained that I work in the area of linguistics and literacy "There, is no relationship between 
linguistics and literacy, " Newman responded. 
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to measure what it intends to measure. Hie attractiveness of reliability is at- 
tached to the fact that it can be quantified, thus giving the air of scientific 
respectability. Psychome'.ries has perfected the area of reliability, and for this 
wr can Ik* thankful, but the more central issue involves. the question of the 
validity of what is measured and the effectiveness of the tqst to measure what 
it says it measures Since the SAT claims not to be an achievement test but, 
rather, a predictive measure-for-sueeess-in-eollege instrument, content validity 
becomes a cloudy notion. The SAT is called an intelligence test by some since 
the difference between measurements of intelligence and measurements of 
aptitude are slight a* best. 2 

Thus the SAT measures verbal and mathematical raptors rather than all 
of the things one might wdnt to call intelligence (providtd we could ever 
agree on what these things are*). During the history of the development of 
intelligence tests, the term intelligence was thought to be unchanging and 
innate. To avoid the implications of innateness, test developers have used the 
term aptitude. Since these tests are used primarily to predict school success, 
the term scholastic was added. One could make the case that the SAT is 
really an intelligence test and that, from what little is known about intelligence 
or how to measure i' the SAT is invalid. If one takes the position that the 
SAT is an aptitude te^t, however, the test is subject to analysis of why such 
group aptitude tests .succeed at predicting school achievement. 

In theory it is possible to distinguish between aptitude and achievement. 
One is an innate property and the other is accomplishment. Once content 
validity has been determined, it is possible to measure achievement. How to 
get at aptitude, however, is not so clear. The SAT selected the mathematical 
and verbal areas as a locus tor measurement. Exactly how this decision separates 
achievement possibilities from aptitude remains very, very unclear. All children 
have* knowledge, memories and feelings about the subjects being measured. 
Furthermore, distinguishing among SAT verbal aptitude items and those items 
in verbal abilities on achievement tests is virtually impossible. The distinction 
between aptitr le and achievement; though possible in theory, is impossible 
in operational practice. What is more, ability appears to increase as achieve- 
ment is acquired. Why is it, then. "that "roup aptitude tests can predict achieve- 
ment? It is highly probable that they are really achievement tests. 

It has been necessary * to. follow this reasoning in order to address the 
issue of content validity. And it is in the area of content validity that the test 
making industry and the other academic disciplines fall awiy from each other. 
Once verbal and uuithcmahcal achievement are reccn>i..ed as integral to any 
assessment of the SAT, we must ask the question — how do we know that the 
SAT measures the right things? Willurd Wirt/ headed the blue ribbon panel 
2 The two liable (and relatively minor) differences Ix'tween intelligence and aptitude tests 

are. 

a That aptitude test* measure specif, i factors rather than general ones, 
b. That intelligence tests measure general factors rather than specific ones. 
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which spent two years studying the decline in SAT stores The panel did not 
address this question although, to its credit, it did rceogni/e it as a legitimate, 
problem: "We have accepted, foi purposes of this inquiry and report, the 
traditional value base of the SATs validity (p. II). ' 

Perhaps one of the most significant statements of the panel report grew 
out of a discussion of why scores on the Achievement Tests (which may also 
be elected by SAT takers) did not parallel SAT score declines. Scores on tests 
in English composition, French, Spanish, biology, chemistry and physics actu- 
ally increased while other subjects reveal only small declines. The panel 
lamented that it had not been able to analyze this phenomenon fully and 
concluded: 

"It is conceivahl> important that the College Hoard make ninth larger nse til outside 
committees in connection with the Achievement Tests than with the SAT the coun- 
sel sought is ui the one case from experts m the particular disciplines, in the other 
from psychometricians ami psychologist^ " p 23) 

This criticism is al the' heart of an extremely important test bias— the bias- 
between disciplines. This criticism appears to be one ol the central issues in 
accounting for the SAT score decline. The SAT is actually a disguised achieve- 
ment test which, m order to be valid, must better address what R*;.lly matters 
in the two content fields which it measures. Like most objective .ests, k is a 
discrete point measure. That is, certain items are selected from ji inventory 
of all possible knowledge in order to represent that larger knowledge in as few 
questions as possible. Only those who know the content of the fields being 
measured have the key to content validity, l est makers create items to test only 
after content validity has been established. 

This lengthy analysis of the SAT Wa> meant to servt as a ^mchstone for 
the issue of disciplinary bias. In general, many psychometricians feel the need 
to develop tests which discriminate between students. A certain number should 
get high marks and a certain number should get low maiks. There is often no. 
particular interest in why these people are so distributed Therefore, when the 
New York State reading test is given to tenth graders, it is estimated that some 
7,000 children will fail in New York City alone What other disciplines are 
asking is, why tins should be so What is there in these children which will 
cause them to tail And if we can show, on i standauli/ed test that tlicy have 
failed, why don't we take these areas of failure and highlight them -in their, 
school curriculum? Those who make up these tests are Lot often interested in 
this issue. Those of us in the disciplines ol language and linguistics, along with 
many others, care less on the whole about reliability and pupil distribution 
than we do about validity or about the question of why the pupijs distribute 
as they do and learning rather than placement. 

(Quantitative analysis in linguistics works best M the individual level 
When we go to compare one individual or gionp with another individual of 

*See, for example, Ho^er W Shm, "Quantitative Anal>sis A Case for and' Some Warn- 
ings Attains!.' in AnthropoloM ami I thuutton {htarUtlif, VII, 2 (Ma>, 1^77 )' 
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■ group we fall into treacherous waters for we face the same problem ihat every 
standardized test faces: the question of thresholds. 

Whenever the threshold of acceptability is decided, it is an arbitrary 
decision* In the case of the Aspira Consent Decree, the ultimate question was 
"What is the cut-off score at which children are sent either to bilingual or to 
-mainstream classes?" For those who have never witnessed the process of estab- 
lishing- such a threshold, let me explain how New York City did it. First, they 
^developed a test mostly of English morphology, phonology and vocabulary for 
Spanish speakers. Then they gave that test to 15,000 New York Anglos ( a term 
which, curiously enough, included Blacks)/ Then they took the score which 
separated the top 90% from the bottom IW of the Angios who took that test 
as the cut-off point to be used to separate Puerto Ricans into bilingual and 
.mainstream classes. The federal district judge who presided over this case then 
asked the New York City schools where that 905T/10? cut off point came from. 
\\ith that question, "Judge Frankel asked one of the most penetrating &iid critical 
questions in all of standardized testing. The answer, of course, was that it was 
arbitrary. The exact words* were: "It was an administrative decision 1 . 

Bias of disciplines, then, is an important consideration in the examination 
of standardized testing. I cannot speak for all disciplines in this, of course, but 
I can safely say that the bias of linguistics is to the validity of the content 
being .ncisured. This is most important of all to us. Reliability, a crucial issue 
to other disciplines, is a fine thing but only after validity is clearly' noted. It 
is unfortunate indeed that the content fields have not yet progressed to the 
point at which we can state exactly what it it we want to measurfc in terms 
so that the science of psychometrics can be best utilized. Lacking this, test 
makers tend to proceed with what they have. And in a sense they cannot be 
blamed for the weaknesses of our fiekl. But it is our obligation to call attention 
to these weaknesses and to work toward correcting them. 

The Wirtz Committee dismissed the issue of potential cultural bias in the 
SAT by observing that a definitive analysis of cultural bias is virtually impos- 
sible. It appears, however, that a definite analv ^s of cultural bias is no more 
difficult than definitive analysis of many other educational concepts. Test ques- 
tions of any type involve matching the assumptions, values and presuppositions 
of the te#t maker against those of the test taker. An understanding of the basic 
principles of language and culture m^e the identification of ootential cultural 
bias available to those who care to find it. 

T}\v major issue of cultural bias is similar to the Issue of so-called language 
dominance .tests. Such tests have all the problems of proficiency tests plus a 
problem which derives from a very uninformed notion of dominance as a static, 
content free characteristic. Much of the exciting new developments in socio- 
linguistics, psychology, philosophy and anthropology arc concerned with the 
effect of context on learning. Context can" be defined as what the learners bring 
with them to the learning process as well as the setting in which the learning 
takes place. Contra stive assumptions, values, presuppositions and setting in 
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testing all contribute to the question of why students are spread across the 
distribution curve. These factors all can be examined, thanks to recent develop- 
ments in speech act theory and communicative competence. The ways in which 
different cultures share these assumptions, values and presuppositions in differ- 
ent settings provide the key to cultural perspective. When this perspective does 
not match that of the test question, culture bias can be identified. Thus, what* 
has been assumed to be virtually impossible to identify in the past, now 
becomes a researchable question. 

This discussion of the three kinds of bias, public, disciplinary and cultural, 
is by no means definitive Nor has the field of testing really addressed them 
satisfactorily The exposition here is intended more as an agenda than as an 
answer. T S. Foot's questions may well serve as a general guide to our think- 
ing about testing for we are certain to be better off it »ve can learn to assess 
understanding and learning rather than information and knowledge. But even 
in ou- efforts at the latter we have overlooked the powerful biasses of the ex- 
pectations of the general public, of the disciplinary perspectives involved in 
producing and interpreting the tests and the mismatch of the assumptions, 
values and presuppositions of the test makers and test takers. All are biasses of 
one sort or another All can cause us to lose* sight of life in the process of 
living. 
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Methods Panel 

An Exploration of New Trends in Second Language 

Teaching: the Silent Way, Suggestopaedia, 
Counseling Learning-Community Language Learning 

WU90 M. Itivars 

Harvard University 

At the beginning of Teaching Foreign-Language Skitts I took you on a 
tour of four classrooms, all engaged in different activities, but all devoted ta 
the teaching of a language (Rivers 1968: 1-7). Now I think its time for us t<£ 
take a tour of orr Foreign Language Institute in 1879 and visit three class- 
rooms, all devote J to the learning of a language. 

In Classroom A, there is a long silence; the teacher looks encouraging; a 
student constructs a sentence; the teacher nods, changes the positions of some 
rods, and gestures to another student who, after a moment of contemplation, 
putt together an utterance and is encouraged by die teacher's reassuring smile. 

In Classroom B, a small group is seated in a circle with a tape recorder 
in the center — the recorder is there not to give but to receive language material. 
The teacher is not obviously the focus of this scene, but with some aides is 
outside the circle, waiting for students to take the initiative. One student is 
speaking, occasionally seeking a word from an aide outside the circle. Another 
student cuts ' in, rather aggressively, refusing tf> accept what the first student 
has jvst said. The other students murmur further objections, seeking help for 
stronger expressidns of, their displeasure from the teacher or from an aide. The 
latter supplies exactly what is .requested in a low tone without intervening in 
the exchange."" All of these utterances are being recorded for later 'analysis by 
the group in discussion with the teacher. 

IrT Classroom C, baroque music is playing. Soft light falls on the pastel 
walls and the students look pleasantly relaxed as they recline, breathing rhyth- 
mically, and listen to a dialogue being recited, alternately in a normal tone 
of voice, then in a persuasive whisper, and then triumphantly. The dialogue is 
quite long, so we leave before the end of the rendition. 

What can these three,, ways of approaching the language-learning task have 
in common, and how is it that they have all come to the fore in chis particular 
decade? iVe rhay note that they are no( ilfcwly forged. Cattegnos Silent Way 
ta Classroom' A was clearly described in the first edition of Teaching Foreign 
Languages in Schools: The Silent Way which appeared in 15*63. At that tirrfte, 

67 ^ 
1 O * 



68 



The Learner in Focus 



Cattegno stated that it was based on 35 years of language-k iehing experimenta- 
tion in 14 countries on 5 continents. Curran's Community Language Learning 
in Classroom B was first described in the Bulletin of the Menninger Clinic in 
1961 (and I discussed it at length in The Psychologist and the Foreign-Language 
Teacher in 1964). Lozanovs Suggestopaedia was the subject of experimentation 
in Bulgaria in the 60's although detailed accounts of it did not appear in the 
United Statesuntil 1970 (Ostrunder and Schroeder, 1970). 

The reasons why these approaches have suddenly become important to us 
can be found in particular emphases which have evolved in foreign language 
teaching in the 70's. In the grammar-translation period (prior to World War II 
in most languages and still continuing in English in many parts of the world 
today) the emphasis was on "Why teach a modern language'*. The justification, 
for reasons of respectability, was based on intellectual rigor and cultural enrich- 
ment, with the -esult that the teaching of modern languages became as mich 
like the teaching of classical languages as possible — with emphasis on ^e 
understanding of a formal grammatical system and ability to read literature 
and philosophy. 

Modern language teachers then passed through the What and How of the 
audiolingual era, asking such questions as What is language? and How do 
people learn languages? The structuralists believed they knew what language 
was. It was a system of arbitrary vocal symbols by means of which the mem- 
bers of a society interact in terms of their total culture," as Trager expressed it; 
(Trager 1949:4) and the behavtorists knew how it was learned: by acquiring 
language habits through reinforcement (or confirmation of their efficacy). This 
emphasis on what and how was continued daring the transformational-genera- 
tive period, when the so-called cognitive-code approach was much discussed 
(though rarely described). Language we now knew was an innate structure and 
our language learners through their innate language-acquisition device were 
hypothesizing about the form of the grammar to which they were attending 
and matching it against their innate knowledge of potential grammars of a 
human language. Despite the theory, in the cognitive-code approach as pro- 
posed in the literature, students had little opportunity to hypothesize about 
the form of the grammar. Since language use was considered to be "rule- 
governed behavior" (a term largely misunderstood) it behooved students to 
understand and apply the rules (Rivers 1976:11). We were back to a deductive 
presentation of the grammar, with conscious practice in the application of rules 
before authentic passages of discourse were encountered (Chastain 1976:156- 
157). 4 

but with the 70s came renewed emphasis on the individual Who are our, 
language learners? Flow do 'individuals learn? What are their personal learning 
strategies? Second-language teachers became wary of presuming they could 
teach a language and began to seek ways in which students could be given 
opportunities, situations, and time to learn as their individual proclivities 
permitted. 
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The three approaches we are to discuss this afternoon all fit into this new 
mood of emphasis on the individual and personal learning strategies. Each one 
tries to give the x student room ancHiine toJearn with as little intrusion of the 
* teacher into the learning process as possible. Gattegno (1972:xii), for instance, 
speaks of "techniques which made it possible for the teacher to say less and 
less as the lessons advanced, while the pupils were saying more and more and 
using their own inner criteria." He speaks of throwing "the learner upon him- 
self (Gattegno 1972: ix). Although all three approaches structure the learning 
situation, and structure it in very specific ways based on theoretical convictions, 
they each envision the teachers rile as indirect. This apples e^n to Suggesto- 
paedia where the teacher's role is highly structured, but the emphasis is still 
on allowing time and space for the students to use their language-learning 
capacities to absorb and assimilate the material. 

What else do these three very different approaches share? 

1. They all endeavor to involve the whole person of the stiu ,<t. Curran 
calls one of Ins books, Counseling- learning. A Whole Person-Model for Educa- 
tion (1972). Gattegno considers breathing and kinesics of tremendous im- 
portance in speaking a new language, as well as the conscious application of the 
intellect. Lo/anov tries to draw on the powers of the unconscious mind to as- 
similate positive ..uggestive factors in an act of communication or a pedagogical 
process. "Hie purpose of suggestopaedia." we an 1 told, "is to put those 'mental 
reserves' to work by organizing a cohe refit system whereby the suggestive, emo- 
tional, more or less unconscious signals we receive, are coordinated" (Racle 
1979:10). Gurran draws these notions together when he speaks of learning as 
a "unified personality encounter" (1976 41) 

2. All view the learning of a second language as quite different from the 
learning of the first. Gattegno calls it "radically different" (1972:xii). Clearly 
the Lo/anov procedures make no attempt to reproduce child native- language 
learning situations, and Gurran encourages students to draw freely on the native 
language in early attempts to communicate. 

3. All three are inductive in the initial encounter of the student with the 
language. Curran's students learn through use, Gattcgno's through inductive 
analysis of the teacher's (limited) output, and Lozanov's students assimilate 
structure the dialogue, which they memorize but without conscious effort. 
ATI three provide explanation at a later point in the learning as students require 
it. Initially, however, all three present rnateiial and let the students do what 
they can with it. To quote Gattegno. "the synthetic, the intuitive, precedes the 
analytic" ( 1972:8 ). 

4. All three approaches are i ion -collective and give the student time, thus 
reducing the «mxic f y and tension of language learning. Correc tions are supplied 
in a supportive \v v ay. as nformution rather than reproof, and as the student 
shows a need or desire tor them. Gattegno makes a strong statement in this 
area "I do not correct learners," he says. "I only throw them back onto them- 
selves to elaborate fnither their criteria and to use them more strongly" (1972. 
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31). Curran found that students wanted time. "The counselor had to be very 
sensitive . . . he noted, "to discern when he was really needed or when, or? 
the contrary, persons wanted time to find, on their own, the required word or 
phrase" (1976:33). Lozanov's "concert" allows the students this time (Jftacle 
1979:44). 

5. Each of these approaches encourages active use of the new language in 
communicative situations from the very beginning. Curran's clients try to ex- 
press their own feelings and opinions with the help of the counselor, or an aide, 
from the first moment. Gattegno's communication situations are structured by 
the teacher with rods, but the students are expected to convey information 
about these situations which they have themselves put together in the new 
language. Ix>zanov's students act out the material of the dialogue as soon as 
it has been assimilated and, by the structuring of the dialogue material, are 
"thrust directly into a communication situation (Racle 1979:46). 

6. Finally, we may note that each of these approaches tries to create a 
community feeling of "all pulling together" (Curran 1976:1,7), which decreases 
inhibitions against expressing oneself in a language of which one still knows 
very little. 

Each of these six factors can be reproduced in other approaches in a 
normal classroom: involving the whole person of the student (affective, cogni- 
tive, and physical); an inductive approach to language material, a non-correc- 
tive, encouraging attitude on the part of the teacher, active use of the language 
in communication as soon as possible, and the building of an esprit de corps 
where students begin to care for each other's progress and form a lancniage- 
learning community. These are aspects of the language learning/teaching enter- 
prise which have been urged on teachers for some years. P^rh .ps it is tjme 
now to do, not just to discuss. 
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Explorations of New Trends in Language Learning: 

The Silent Way 

Mary Hints 

La Guardia Community College 

When I realized I had fifteen minutes \o explore the Silent Way, my first 
thought was to be silent. Then, tiOne reminded me of watches— the Swiss watch 
with seventeen jewels and moving parts &r.d the digital watch. Digital watches 
from Te*as have little to do with watches from Switzerland. The two cannot 
be compared with the same criteria. One cannot criticize a digital watch for 
not having hands or a spring. One can criticize it for not keeping time right. 

And &o it is with the Silent Way. The method is a revolution and cannot 
be compared easily with familiar methodologies. U can be contrasted with 
familiar methodologies and it can be analyzed, something Earl Stevick does 
effectively in his book Memory, Meaning and Method (Stevick 1976). For de- 
tails of the method I refer you to that book and Caleb Cattegno's own WDrks 
Teaching Foreign Languages: the Silent Way and The Common Sense of 
Foreign Language Teaching (Gattegno 1983; 1976) and Fan&low's review of 
Gattegnos work in the TESOk Quarterly; Fanselow 1977). 

The Silent Way is a revolution— a dramatic change. At the stone time, it 
is a radical return to ancient, honorable theories of man, the natur^of man, 
the nature of learning, the purpose of learning. \ 

There is a mystique about the Silent Way, unfortunately, with its key 
words: rods, fidels ogdens, inner criteria, awareness. Note the range of those 
words — the concrete rod to the abstract awareness. It is the last that I want 
to explore today. 

I think there is this mystique about the Silent Way because Or. Gattegno 
doesn't identify or relate his work explicitly with the tradition a reader recog- 
nizes inherent in it. The Silent Way is new. The Silent Way is old. The method 
is new. The philosophy behind it is old. Very old. 

The sub-title of Gattegno's first book on the Silent Way is the "subordina- 
tion of teaching to learning," a pjjtase that leads us back deep into our philo- 
sophical heritage. 

Socrates considered himself a spiritual mid-wife— the teacher who could 
not teach, but who could help another give birth to his soul. Plato ir.unortalized 
tip Socrates, the inspiring questioner, ime who provoked others to know that 
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they did not know and thus to join the thoughtful search for self. (The Re- 
public: trans. F. M. Cornford 1945) Centuries later, the French philosopher, 
Montaigne, reflected on the ancients and made a distinction between learning 
and study and identified study with education, a leading out of the self (Mon- 
taigne, Selected Essays, trans. C. Cotton and W Hazlitt, 1949). Montaigne 
wrote that teaching and learning might impart knowledge, whereas study led 
to understanding; things liown were made one's own and became a part of 
ones judgment. Education .vas a continuous heightening of consciousness, an 
increasing sharpening of judgment. Such is a description of awareness. 

Examining the words of Plato, Robert McClintock (1971) notes that 
teachers "could not fruitfully instruct those who would not teach themselves, 
who would only respond passively to the most convenient appearance;" the most 
teachers could do was to convert inert souls to active study. "This theory of 
learning has sunk deep into our philosophical heritage, but it has not fared 
well in practice." It has not fared well. But, Dr. Cattcgno reminds us that 
students must assume the responsibility for their own learning. 

Dr. Gattegno is a classicist. He's also a Romanticist. Listen to his words . , . 
"It is important to think of language learning as a recovery of the innocence 
of the self." The Silent Way is Socratic. It is reminiscent of Rousseau. 

Socrates, in his questioning, was nurturing not only the process of thinking; 
he was nurturing a faculty that grows with use, a faculty whose natuial aim 
was the discovery of truth about the self The distinction, again, of Montaigne. 
Learning provided tools. Study, education, was the search for truth. All of this 1 
is edifying. And we need to be edified. , 

Dr. Gattegno reflects on truth. He also urges teachers to allow students 
to establish an inner criterion for what is right and wrong, correct or incorrect 
about the language, and, one suspects, inner criteria are closer to Plato's cav£ 
than to interlanguage. 

The philosophy I see inherent in the Silent Way places the school solidly 
in a sound, profound tradition of humanism. There are controversies, however. 
The teacher so respects the student, for instance, he expresses neither approval 
or disapproval. The responsibility for learning is the student's. So is the credit. 
But. while the subordination of teaching to learning is probably the greatest 
contribution the Silent Way has given us, this refusal to give either approval 
or disapproval is probably the most controversial aspect of it. j 

The other controversy is about th:« use of silence itself. Now, we al! would 
like our students to learn English as efficiently as possible. The Silent Way is 
one way to learn. There are others. For the degiee to which one follows the 
Silent Way in practice steins from ones reaction to silence Encounters with 
silent bring peace, anxiety, tension, communion, reflection Teachers who have 
experienced the Silent Way have shared their reaction to it For one. it was a 
painful experience at first. Her insecurity, her defciisivcness. came through An- 
other found herself having to (leal with hei own frustration, her nervousness, 
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as a result of being silent. Both finally felt unburdened and found their silence 
allowed them freedom to become aware of students, the learners. 

Other teachers, however, emerge from a Silent Way weekend saying they 
have never witnessed such a mechanical, impersonal, punishing class. For, the 
students, too, are encountering silence. The tension built up when the teacher 
is silent is frustrating for some student and, just as some teachers say this 
method is impersonal, punishing, so do some students react. When an answer 
is not forthcoming from the teacher, they become angry. In some cases, the 
experience is so punishing, students leave. Many others, however, pass through 
the experience and thrive on it. They are happy to have a teacher step out 
of the way and let them take charge. Silence e* okes tension. It evokes reflection. 

Whether silence leads to tension, which explodes in frustration, or to re- 
flection and creative exploring depends, possibly, on the teacher, but, more 
likely, on the student, his personality and his learning style. Yet it is here 
that the Silent Way is silent. I see no word on the idiosyncratic ways of learn- 
ing. We are given the observation that language is not deduced but felt. Now, 
I may be trying to compare Swiss watches with digital watches to look for 
allowances for individual learning styles in a school that is based on the one 
nature of man, but with the controversy about the giving or withholding of 
approval or disapproval, it is a question. 

In sum, the Silent Way is a multi-colored tapestry — as colorful as the 
rods — of philosophy and psychology and pedagogy. Its theory brings us back 
to the finest traditions of education. Its practice is unconventional. New. Old. 
Classical. Romantic. Silent. Paradoxes . . . 

The paradoxes I see in the Silent Way remind me of an old Russian story. 
And I don't think I do violence to the tale when I associate language with life. 
Not when I think of Socrates. ... It is the story of a rabbi who lived in a 
small Russian village. He spent his life searching for the meaning of life. He 
despaired of an answer. Then one night he had a dream. And in the dream 
he was told to go to Kiev. And in Kiev he vvould^ find a soldier, a sentinel, 
on the bridge over the river. The sentinel would have the answer for him. 
And so the rabbi sold all his belongings and saved for the trip to Kiev. And 
he travelled to Kiev. When he reached Kiev, he ran to the bridge and there he 
found a sentinel. "Thank god, you're here," he saitl. "I had a dream the other 
night. And the dream told me you would have an answer for me." The sentinel 
replied, "Oh, you should never pay attention to dreams. I had one myself the 
other mg^t and in it a rabbi came up to me asking for the answer to life. I told 
him the answer was back in his home, behind his fireplace." And so the rabbi 
rushed back to his small village and in his cottage, behind the fireplace, he 
found the answer he was looking for. 

For me, the story means that the spiritual treasure is there, with you, in 
the heart. But you have to go somewhere else, to another teacher, outside your 
tradition, to find the treasure. To find yourself, you must go to a stranger. 



The stranger, for some in our field, may be Dr. Gattegno. For others it 
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may be someone else but the important thing is that there be Socratic mid- 
wives to provoke us to know that we do not know and that we must continue 
on the thoughtful search. We are journeying. Lets continue to explore. 
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Methods Panel 
A Review of Suggestopedy 

Thomas Scovtl 

University of Pittsburgh 

One of the many reasons I find attending the annual convention of our 
organization such a delightful experience is» the opportunity to be exposed to 
the new: to meet new people and new faces from our evergrowing circle of 
professional colleagues, and, most important of all, to encounter new ideas 
and new theories, because they, and^ they alone, allow us to turn from the 
little we now know and look forward with an open but critical mind to the 
vast unknown that confronts fcach one of us. It gives me great pleasure, there- 
fore, to see that the convention chairman, Carlos York), has, in typical wisdom, 
arranged this panel as a plenary session with the specific goal of evaluating 
three of the most popular new methods of language teaching for you. ft is a 
personal delight and honor for me to be able to come literally halfway around, 
the earth, from the world s oldest continuing civilization— China, to this marvel- 
ous young infant of a city — Boston, to participate in this panel, and, if the sup- 
porters of suggestopedy ar* unfortunate enough to interpret what I am about 
to say concerning the Lozanov method as a persona! affront, they at least will 
probably be just as happy to know that at the, convention's close, I am flying 
right b&ck to the other side of the globe! 

William James once wrote that we should be tender-minded* in the face 
of new ideas, but tough-minded ir^ever accepting them. I have already alluded 
to the importance of the former: let me sketch briefly why I believe we as 
language teachers and language researchers should be tough-minded in our 
acceptance, especially. when we are given the Weighty responsibility of speak- 
ing to hundreds of our colleagues at a plenary session. It is my belief that any 
assessment of the potential contribution to our field of a new theory, method, 
or technique should begin with the direct claims (or absence of claims) that 
the author qf the innovation might make about its potential relevance to foreign 
language pedagogy. Because Georgi Lozanov, the founder of suggestopedy, 
and his followers do indeed make powerful claims about the enormous benefits 
that suggestopedy can effect on second language learning progress, it behooves 
us as fair-minded language teachers to take Lozanov seriously and give sug- 
gestopedy a thorough and attentive review. Let me summarize for you, there- 
fore, a few of the claims that have been made, and then proceed to assess them 
from both a theoretical and practical viewpoint. The information upon which 
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this brief critique is based derives from an intensive reading of the first English 
transition of Lo niov s Suggestologtf atu! Outlines of Suggestopedy, a review 
of several art'jles in populai magazines and professional journals, and attend- 
ance at a halfway session on suggestopedy given the fall of 78 at the annual 
convention of the TESL association of Ontario. Throughout this review, I have 
attempted to acply niy tough-minded experiences as an applied psycholinguist 
at the Unversity of Pittsburgh and those at my present position teaching EFL 
in northern China. 

• The claims begin with Lozanov himself. Using suggestopedy as an aid 
to the memorization of vocabulary items in foreign language classes, a teacher 
i; guarantied the following results in a 24-day course with four lessons a day — 
given at any tirre day or night and with no homework alignments: 1) 90% 
retention of a 2,0(X) word vocabulary, 2) the ability to "speak within the frame- 
work of the whole essential grammar" and 3), perhaps most astounding of all, 
to be able to read any text (Lozanov 1978:321*322). Finally, in perhaps his 
strongest claim, Lo/anov proposes that his theory has a place, not only in 
psychotherapy and pedagogy, the two fields for which it was originally con- 
ceived, but in literature, music, sports, adsertismg, and commerce. "In fact," 
he concludes with gusto, "there is no sector of public life where suggestopedy 
rould not be useful" (Lo/anov 1978:3)! 

his followers m North America have not lost any of Lozauors zeal in 
promoting what they refer to as suggestopcdia, a s)nonym for suggestopedy. 
Hordon and Schuster (1976), writing m the Journal of SuggcsHve-Accelerative 
Learning and Teaching claim that learning is 2.5 times better with suggesto- 
pedy. In fairness, this . an enormously nodest claim compared to a report 
issued h, the Suggestopedy -Research (.roup, established by Lozanov in his 
native Hi? 1 iia. which enthusiastically stated, "As seen from the tesults ob- 
tained in the experimental groups, memorisation in learning by tl uggesto- 
pedic metl.od is accelerated 25 times over that in lea ning by conventional 
methods" (Lo/anov 1978:27). When sreh results are publicized in tuc popular 
media, the impression ol a major revolution in language teaching is immediately 
intimate 1 especially when Parade magazine, with an estimated readership in 
North America of about 40 million, ran a laundatory story in ih spring of 78 
with quote*, such a t ! ie following from Dr. Gabriel Raclc, who Lad an oppor- 
tunity to publicize th Lozanov method at <» plenary panel at last vear's TESOL 
convention, 

The approach benefits the whole person. Not jmly do psychosomatic piohlems oflen 
disappear (headaches, stomach tr« i 4 )s t anxiety), hut ue see Ins personality develop, 
lie beconies more self-assured, niorv- ircatne, and expresses himself more easily 
(Miele 1978:14) 

What I found equally disturbing, howevei, about the current state of sug- 
gestopedy hi North America is not solely that unverified claims such as these 
are continually bandied about as fact, but that individuals and organizations 
who 'may or may not have a thorough, piofessional training in foreign language 
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tcaehiug arc making goixl uione\ laigely on the basis of these extravagant 
claims. As just one example, the Lo/uuov Learning Institute, estahlished in 
Washington, D.C and reputedly officially licensed to promote suggestopedy in 
the United States, was deluged with hum 11.000 inquiries about the method 
after the* Purdue article last >ear. They weie willing, in ti form letter sent to 
me at Pittsburgh, to froude a 24-hour lecture for $500 plus travel and accom- 
modations, or a 2-day vvoikshop at $1500, also plus expenses. I am dismayed 
that the TESOL organization has sometimes given unwitting support to wnat 
I feel is a blatant toinnieiciali/ation of an imp* . n method, and I ain aimost 
tempted to offer my services to travel anywhere aiound the globe to deliver a 
reasoned refutation of such pedagogical panaceas at one half the price, plus, 
of course, travel and accommodations! Let me share with you in the time that 
is left why I believe suggestopedy remains over-publici/ed and unvalidated. 

Lo/anovs hook, which introduces Ins theory and presents what he be- 
lieves to be {he empirical evidence for its effectiveness when applied^ to lan- 
guage teaching, is no lightweight monograph,^ Interspersed among its some 
077 pages are 47 tables, 53 figures- one of which includes an excerpt in Italian 
from Panted /n/. mo as a sample lesson plan (forgive the pun, but what a hell 
of a class that must he to teach!), and a superficially impressive bibliography 
from the literature of neuropsychology and psychiatry of over 600 English 
items aiid over 400 items in Bulgarian. What emerges from a casual glance at 
the text, therefore, is a weighty scientific document, replete with references to 
empirical experiments. In fact, the entire manuscript is and I choose the word 
carefully) suggestive of the notion that lozanov speaks with the authority and 
impact of scientific proof Ujifortunat !y, a carefi i reading of this major work 
reveals that then 1 is precious little m suggestology that is scientific and that 
viewed in the context of the enthusiastic claims about its success t launched by 
Lo/anov and his supporters, suggestology, at least as it is introduced by its 
founder, repiesents nothing more than an oversold package of pseudoscientific 
bunk! There are several reasons why I am led to be so sharp m my criticism; 
let # ue simply focus on one of them: Lo/anovs continual propensity to dress both 
theoretical assumptions and piactical claims with the superficial trappings of 
scientific expei mentation 

Tins is not the appropnate forum to dissect Lo/auov v . expeumental meth- 
odology. For a more detailed icvievv of Lo/dnovs work, see Scovel (1°79). Suf- 
fice it to sav, beginning with the very hist experiment reported in his manu- 
script, the serious- readei will find that time and time again, Lo/anov's experi- 
ments aie sloppilv constiucted and shoddily reported, leaving the impression 
that all the icfeieliees. tables, and figure 4 /»e constructed to present a shiny 
veneer of empirical pioof, while in icahtv, they appeal at least to me, to be 
singularlv ummpicssivc. Lo/anov nevei once bothcis to dc>ciibc the curriculum 
studied by his eontiol subjects, he never once takes the trouble to detail the 
nature of the icsts which leVeal such constant and impressive memory retention 
setfres in favor of his suggestopedic students, and he eonsir tentiy fails to explain 
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why the numbers of his subjects vary so dramatically from experiment to ex- 
periment—in one venes of investigation alone, for example, there are as many 
as 416 and as few as 7, the scores improving as the numbers decline. 

Many of Lo/anov's experiments fail to eliminate or control intervening 
variables which, in all plausibility, m i\ account for hrs test results much more 
effectively than the impact of the suggestopedic method. In corroborating the 
claim that suggestopedically nu : ton/ed material is retained for long periods 
?f time, Lo/anov reports on the results of vocabulary tests which have been 
given as long as 22 months aftei the initial learning period, again with impres- 
sive success. What one does not know, unfortunately, is whether the words 
tested were part of the common forVign language vocabulary that the students 
had been employing both formally and informally during the period of time 
after they had first learnt these words. If they were tested under the same 
experimental procedures, I would display an amazing 1007 retention rate of all 
the Thai words I even learnt in my first few weeks of Thai lessons fully 15 
years ago, not becausV of the efficacy of the quasi-audiolingual method I was 
subjected to in a h * humid, and Moisy classroom by a semi-conscious teacher 
to a particularly dull student, but because of the simple leason that I have heaW 
and spoken those 'words over and o\er again during tli£ intervening period of 
time. The problem with Lo/anov's experimentation is not simply that he em- 
ploys a different experiment paradigm, (me that is based on eastern European 
and Soviet philosophy and psychology. This particular point is raised in his 
defense by Jane Bancroft of the University of Toronto, who impresses ine as 
the most prudent and reasonable supporter of suggestopedy that I have yet 
read (Bancroft, 1978). This defense rapidly evaporates however, if -oik; r coiij~ 
pares his work to the father of Soviet psychology, Pavlo\, whose early experi- 
ments in classical conditioning are more carefully constructed and more imf p 
pressively reported than Lozano\\s disappointing attempts at scientific valida- 
tion. t 

There is also the important matter of the difference between memorization 
and learning I do not need to belabor this difference to you, my fellow teachers 
in the audience, although, I hasten to add, it is still a source of pedagogical 
confusion in contemporary Chini where most teachers • si!! 1 believe that the 
seeds of memory will ensure a rich harvest of learning. Despite the one or 
two references to the possibility that language learning coirnxises more than 
memorization of lexical items and phrasv.% the entire thrust of Lozanov's peda- 
gogical method is dirt '.ted at enabling the student to memorize large quantities 
of material in short periods of time. Even in the solitary sentence of the book 
where the author attempts to extricate himself from the error of equating 
memorization with learning, he compromises his position in the* very next 
breath. "The main aim of teaching," he writes, "is not memorization but the 
understanding and creative solution of problems." "However," he* continues, 
"the main obstacle encountered in teaching is memorization, rutomatiou, and 
the assimilation of the material presented" (Lo/anov 1978:251). I find it im- 
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possible to distinguish any substantive difference between 'main aim" and "main 
obstacle" in the preceding quote. More importantly., the innumerable references 
td experiments on memorization and the recurrent discussion* of hypermnesia, 
to the total exclusion of references to "understanding" or "creative solutions" 
convince me that suggestopedy is an attempt to enhance memory and is not 
devoted to the far more ambitious and important business of language acqui- 
sition. 

If Lo/anov's enthusiastic claims about the efficacy of his method continue 
to linger only as unsubstantiated hyperlwle, and if his principal interests remain 
focussed on hypermnesia and no* on issues much more central to language 
learning, what are we left with as tar as any possible iujw contribution from 
suggestopedy to our held'' As far as I can see, very littlt Al>out the only al- 
ternative open to us is to follow that lead of some of tl e North American pro- 
ponents of suggestopedy and borrow some of Lo/anov's more wortii,/hile tech- 
niques, integrating them m(j an eclectic method which itself is not based on 
Lozanov's theoretical approach. Such ideas as the concert session, the use of 
role plays, and the assignment of new names and identities to the students in 
the second language classroom are all useful activities and could, indeed have,, 
been inewporated inh. successful programs for both children and adults. The 
inclusion ot these procedures into a more eclectic method is not, however, 
tolerated by the founder >f suggestopedy. Lo/anov is unequivocal in his stance 
against any tampeiing with his method. "Any eclectic combination of sug- 
gestopedy with- otheTsniethods,'* he sternly warns, "brings -a risk of lower ef- 
fectiveness and* of fatigue m the students'* (Lo/anov 1978:33$).' In vifew of 
Lozanov's defensne position about accommodating suggestopedy with a broad- 
based language teaching program, I do not believe it is fair for those who, 
employ -only n%s and pieces of Lo/anov's approach to label their method of 
teaching with the same brand name as the original, untainted product. 

What then can I leave each one of you which is new and instructive apart 
from this critical iuid perhaps intemperate appr "sal of what might have ap- 
peared to be a major, innovation in language instruction? I can only offer the 
old but simple fact that there are no easy paths to follow along the toads that 
lead to greatest promise. Certainly, learning a new*IUi)guage is one of those 
■journeys with a goal of inestimable value. The fact that suggestopedy does noj 
provide any short cuts should not deter us or our students, however, from 
pursuing our common destination. Vs the Chinese so eloquently put it, °qian li 
/hi xing, sin yu /u xia' 'a journey of 1,000 miles begins with the first step." 
Lvt us take each step carefullv, but with confidence Godspeed to each. of you 
on that exciting journey! 



ERLC 



on mot n 



Methods Panel 

* 

Exploring Community Language Learning 

Barry P. Taylor 

University of Pennsylvania 

Currans Counseling-Leaming Community Language Learning (C-L/CLL) 
model (1981, 1972, 1978) is seriously misunderstood. It shoi *je made clear 
from the outlet that Curran s writings do not suggest that we revise or adapt 
our approach to education; rather, they advocate the adoption of a fundamen- 
tally different framework, one that is difficult to perceive if we approach it with 
a "questioning" rather than "questing" attitude (1972). 

Understanding Curran s approach requires an openness to the totality of * 
his perspective (Rardin, personal communication). One who cannot view it 
in this way may well feel, as I did when I began probing into the core of 
C-L/CLL, that it is a cultish, closed society. I found myself put off by its ap- 
parent lack of acknowledged limitations, its claims of success in the absence o r 
rigorous statistical validation, and its lack of attention to the findings of lin- 
guistic and language acquisition research. 

These criticisms may well be valid from ou^ rigorously scientific point of 
view, but before we judge, we should try to catch the essence of Curran's rela- 
tively new, humanistically-based perspective. Once we have experienced it on 
its own, terms, we will be in a much better position to evaluate it fairly. 

1. A brief overview / 

1. CounselinfrLearning. The theor^icaljunderpinnings of the C-L model 
are complex. In brief, Curran maintains trnrf learners approach education with* 
personal learning conflicts, hostilities, and anxieties which can block intellectual 
learning. By incorporating the "whole person" of the learner jx&o the educa- 
tional process through an application of techniques gleaned f/om counseling 
psychology, especially "client-centered therapy" (Rogers 1985), the C-L model 
offers a means by which teachers and Jearners can deal productively with these 
feelings (Rardin 1976). 

The C-L model places heavy emphasis on the power of group, psychological 
forces. LaForge ( 1^971:48«49) suggested that' for most traditional classroom 
situations, "the assumption seems to be that learning is an individual affair 
somewhat accidentally taking place in a group situation." In group learning, 
however, LaForge suggests, emotional* support and acceptance are supplied by 
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the group to each student. Individual defensiveness is reduced in this less com- 
petitive, less rejeciive, more secure tmosphere. 

The result -of Curran's research in utilizing counseling strangles to deli 
with the anxiety and fear experienced by many adult language ieimers led 
to the formulation of CLL. * 

2. Community Language Learning. It has been claimed that traditional 
language teaching approaches often produced studf-nh who had a good com- 
mand of the "rules" of the target language but were insecure and unable to 
communicate (Curran 1976) GLL's emphasis on making the target language 
immediately operational is, to a large extent, a reaction against these o,der ap- 
proaches. 

Today, however, language classes are fundamentally different from those 
of the past. The current trend toward learner-centered, context jally-real ac- 
tivities designed to involve the learners actively is paralled'to many olKthe 
techniques used in the CLL class. But one who only notices the surface ciiar- 
acteristics lias, missed the essence of CLL because its core does nof reside in 
the activities or techniques themselves. 

CLL is not a method of language teaching — it is an approach ( Rardin, per- * 
sonal communication). As vuch, its focus is not on prescribing specific language 
teaching activities but on creating an entirely new approach to education— * 
one in which the teacher gives up his sense of power and authority in *he 
classroom in favor of entering into the world of the students. The teachers role 
is to facilitate a feeding of security and self-esteem ^within his students which 
will enable them to rely confidently on him and each other, without feeling 
foofeh in. their ignorance; as they progress from a state of linguistic dependence 
to independence. The ernp hasis js on the ftqjtensely personal growth of each in- 
dividual in th& group as ,he*t?nters into a "creative affiliation Rardin, personal 
comrauni cation) with the teaclier. This approach inquires, a fundamental 
change in the teacher s perspective as he gives^up his role of "answer man— 
fcnower of all/ and- adopts the cqunsefhtg ikills anfl sensitivities which will 
alio*; him to relate to the students from their perspective arid become an "un- 
derstanding counselor" who fully appreciates their struggles.* 

My CLL experience in Japanese illustrated many of these precepts. Be- 
cause the classes not "teacher-centred. "*we took the responsibility to initiate 
much of our own learning. ,THe result was that we were never |pressu red; as 
learners, we decided when it w^Jiny* to move ahead or wheft we needed 
"space." One* of the most striking 'ofWrvatnms T om this language experience 
was how much Japanese we learned fn oniy te lours. The momentum, once 
started, progressed at an incredibly rapid pace. Perhaps the most surprising 
observation of all was the extent to which ^is dass of strangeis gave, up some* 
of themselves to support each o.lier, even the slowest "student, and create a 
sense of community. ^ . . 

\ From these experiences. I feel that this approach certainly has value and 
much to offer the language Caching profession. But, to quote Brown (1977: 
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371), . . We would do well to attend to the possible tempering factors 
which I would now like to address. 

, " J2. Questing , 

Over the past few years there has been an abundance of research to sug- 

~ gest the enorrnous and variable role affect plays in second language acquisi- 
tion J^rown 1973). Scovel (1978) recently discussed the difficulties entailed in 
defining "affective variables and opted in favor of the broader term learner 
variables," of which, affect is just one. Learner variables range from physical 
emotional to perceptual to cognitive differences among learners. Anxiety, whp 
Scovel categorizes as just one affective variable among many, is one oythe 
rnajor targets of the C-L/CLL approach. 

Like most learner variables, anxiety affects different learners in different 
ways. In his article, Scovel cites research to indicate that sometimes even Anxiety 
can be facilitating rather than debilitating. Given the wide range of possible 
differences among learners, we must be wary about ascribing tjjo-moch im- 
portance to any one variable or set of variables within the context^ a teach- 
ing approach; it is unrealistic to assume that one approach will work equally 
well with all learners (Scovel 1978, Brown 1977). 

While it is generally recognized that students learn better when the teacher 
is warm and supportive (Hanchey 1976, Scovel 1978, Stevick 1973), we must 
I , also realize that students can sometimes create or carry their own learning 
blocks, regardless of what we do. In a report Evaluating the effectiveness of 
the CL.L approach in training Peace Cprps volunteers in the target-country 
setting, Rardin (1975) discovered that the combined variables of time pres- 
sure, diffi'mlties in adjusting to the foreign culKire, individual learning styles, 
K'id levels of motivation and expectation created a resistance to the CLL ap- 
proach on the part of some learners. These trainees, rather than seeing CLL as 
a relaxing way to fe£|yi* language, found tj>emselves more comfortable wjth the 
*- more traditional method?. ^ " 

Th«re are also mixed reactions to the student-directed aspects of CLLr 
Since the focus is on student-initiated conversations and gaming, linguistic forms 
are not presented in a predetermined order, but only a* tjieir need arises. Al- 

x , though most of the ■ learning is inductive, occasionally deductive teaching is 
don^ by the teacher or one of the students (Rardin, personal comrntlnication). 
While the teacher is free to suggest or recommend any* hiaterials that he feels 
are appropriate, the students are given the responsibility to become personally 
involved so as to foster the self-investment which is presumed to maintain in- 
terest and motivation (Curran 1976). 

Although this personal freedom was one of the most exhilarating aspects 
of my CLL Japanese experience, we must realize that my class was composed 
entirely of language teachers and, as such, we were well-equipped to direct our 
own learning. One wonders how classes with different kinds of students would 
proceed. Would the classes be like those Brown ( 1977*370) described when 
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# * 
he wrote, "the initial stages of this process are grueling — there are days and 
weeks of .struggle and sometimes intense confusion until some insights begin 
* to be drawn by the learners"? How long woGld these students stay interested? 
How long would the faster learner* encourage and support the slower learners 
and still feel that they themselves were learning!* 

Although our group entered the CLL experience fully recognizing that- 
it was only an extended demonstration, students under personal or external 
pressures to succeed >eem to react differently. The resistance of some of the 
Peace Corps trainees referred to earlier extended to the non-directed aspects of 
CLL as well. An objective k report by a Peace Corps language specialist (Han- 
chey 1978:3) indicates that some of the new trainees found the approach frus- 
trating. They issued "a plea for a return to the imposition of planned curricula 
and away from the freedom' of the new approaches." 

And what of foreign ESL students in the United States? Are they more 
like my fellow CLL participants or like the Peace Corps trainees? My gue*« is 
that these ESL students would suffer from the same anxieties and pres.su* s 
the Peace Corps volunteers. The pressure to "pass" TOEFL, to be accepted at 
a college or university, to adjust tc a different culture, not to mention a dif- 
ferent language, and all m a short period of time, could contribute to impa- 
tience and quite possibly anxiety, hostility, or even culture shock. As indicated 
earlier, students carrying this kijid of internal anxiety appear to be resistant to 
the CLL approach. Could it be that these learners, obviously instrumen tally 
motivated, find the integratively motivated CLL approach incompatible with 
their perceived needs? 

Leaving these considerations aside, is the CLL approach mechanically ap- 
plicable for ESL classes composed of students with heterogeneous linguistic 
backgrounds? OrtT of tfie ways that CLL builds security is by using the stu 
dents* native language in class, at least at the early stages (Curran 1972, 1976). 
Although it has been pointed qut (Rardin, personal communication) that a, 
single monolingual teacher can utilize similar techniques, using only the target 
language, in a multi-lingual class, I have never actually seen such a demon- 
stration. It would appear that even if such a te<h:iique could function effec- 
tively, the security of using what is known and familiar (i.e., the native lan- 
guage ) would be lost. The piifod of reflection, which is an integral component 
of the CLL approach in that it offers sttidents the opportunity to express their 
feelings, reactions, and concerns (LaForge 1971:55), would also lose its impact, 
especially at the leaver levels where security is so important, if the students 
were unable to rely on their native language^). The alternative would be the 
prohibitively expensive solution of providing native speaker counselors for each 
language represented in the class 'Brown 1977). 

And what , -of the language teachers themselves^ As Brown (1977) has 
noted, CLL requires teachers well trained not only in linguistics, so that they 
could work comfortably without a text, but also in translation and counseling 
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skills, addition, these teachers would need to he native-like, if not native, 
in the ferget language as well as highly proficient in the- students* native lan- 
guage^). And where would these teachers receive their training, since most 
M.A. programs do not ofFer HI courses of\study in C-L/CLL? 

These questions are not intended to detract from the enormously valuable 
contribution which C-L CLL has made to the language teaching profession. 
While it may not be a panacea for all of our problems, C-L has dealt straight- 
forwardly with many of the central issues facing us. It has questioned the role 
which we plav in our classrooms and has proposed revolutionary changes which 
are well worth considering. It has made us aware once again of our students 
as people — those individuals who so often seem to be forgotten in our concern 
with new techniques and statistical research findings. It has takeJv.some of the 
research in learner variables and made it operational in the classroom. But per- 
haps most importantly, it has forced us to iook at ourselves and evaluate what 
we do. This is a challenge that must be tal en seriously. 
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Evaluating, Adapting and Innovating 
Language Teaching Materials 

Chriftophtr N. Candtin and^Mifcka*! P. Sreen 

University of Lancaster 

This article has two aims: first, *o offer a comprehensive set of questions which 
can be used in teachers' evaluations of the materials with which they work; 
second, to offer a number of proposals which may be used as a basis both for 
the adaptation of those materials and for the design of new materials. The 
proposals for adaptation and innovation are based upon the growing recogni- 
tion that teaching materials ought to confront the learner with examples of and 
opportunities for language use. The proposals are therefore intended to serve 
those teachers who wish to enable their learners to learn to communicate. 1 

In this article the term materials refers to any published or unpublished 
data in any medium or collection of media used for the purpose of language 
teaching and learning. We intend that the guidelines for the evaluation of 
materials can apply to single items as well as sets of materials. 

The guidelines for evaluation and the proposals for adaptation and innova- 
tion focus on f mr main issues: 1) In what ways should materials be appropriate 
to the teaching situation and to the curriculum in which they are used? 2) What 
should be the content of the materials? 3) What teaching-learning activities 
should the materials promote? 4) What new directions are available for ma- 
terials design? Each section of this article takes one u these issues and discusses 
a range of questions which arise. 

1. Putting Materials in their Place 

Language teaching materials need to be seen in their proper context. First, 
we need to know to what extent materials are sensitive to the teaching situation 
in which they are to be used. Second, we need to be sure that resource materials 
are in fact appropriate to the particular language teaching curriculum. We can 
specify each of these requirements more precisely. 

LI Are the materials sensitive to situation? Many current materials seem to 
give the impression of ideal usefulness and applicability for any learner — at 
some assumed level of proficiency — in any place in the world. Such materials 
are not so much ideal as idealised, they do not take account of situation. Unless 
they are adapted to specific situations in some significant way they cease to 



1 The actual criteria on which the proposals are based derive from a definition of language 
as communication and a view of language learning as a communicative process. A detailed 
consideration of these criteria and their theoretical and research background is given in M P. 
Breen and C. N. Candlin ( forthcoming). 
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be of any direct value to anyone anywhere. Such global applicability almost 
always involves compromise when the materials are actually used, compromise 
which attempts to translate the ideal into the actual situation. In fact, there can 
be no such thing as both ideal and universally applicable language materials. 
Even when materials designers have a particular situation in mind, they must 
recognize that their materials will always undergo processes of reinterpretation 
by the, different users of the materials as teachers, learners, and classrooms im- 
pose their specific meanings on the materials. The om> alternative to this 
reinterpretation process is when the materials come to prescribe the teaching 
and learning. Teachers can be driven by the materials, so the efforts of the 
learners to learn and use the language is subordinated to the specific require- 
ments and conventions of the materials. In such- cases, learners are obliged to 
suspend their own sense of reality and teachers are obliged to instill interest 
and involvement in something over which neither teacher nor learner has much 
possibility of control. 

In what ways should materials be sensitive to situation? We are aware, 
first, that every language teaching situation reflects certain socio-culturally 
determined educational aims and philosophies; language teaching is supposed 
to serve the individual within the context of the state. We know, too, that any 
language teaching situation has its own specific human resources, its particular 
teachers and learners. Third, it exhibits its own material resources; its own 
financing, its facilities, and the time invested in its teaching and learning. Each 
of these characteristic constraints imposes its own demands on teaching and 
learning and, in particular, on the materials to be used. 

T'/ what extent as far as the matter of human resources are the materials 
appropriate to the teachers and the learners? We must remember materials 
producers are in the business of making materials attractive to teachers, not to 
learners, so materials producers might suspect that teachers will choose materials 
which match their own experiences and preconceptions of language learn- 
ing. Materials must therefore look familiar. To what extent do the materials 
constrain rather than exploit the teachers own competence as a teacher? Do 
they reflect or contradict the teachers theories or beliefs about teaching and 
learning? Do they match the teachers preferred methodology and open up 
new directions for it v Dots the view of language implicit in the materials con- 
flict with or complement that of the teacher? 

Have the matciials been learner-testcd v Have they Ix'cn evaluated by those 
learners who use them' Aie f *ieie elcai guidelines for learueis on how the niate- 
nals are to be used? If the materials have been leainei -tested, and if they guide 
the learner in their use, which group of leameis weie the mateiials planned for 
and evaluated 1>\ Often enough, learners aic made to match materials rather 
than the reveise 

What other lesourccs aic assumed by the materials? Do they call for special 
classrooms and special timetables J Do ti» \ lequiie media resources beyond the 
capacity of the institution to provide? What is the relative cost of the materials 
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compared to its actual cost to the student? What is their cost relative to their 
life in cost-benefit and survival terms* There may be greater benefit in smaller 
quantities of varied materials than in classroom sets of one particular course. 

Of course, materials share with all the other resources in language teaching 
the potential to overcome obstacles, but to do so their user must at least begin 
with a sensitivity to that situation. We have seen that materials themselves 
can become a constraint. Materials pose two basic questions: Is it the teaching- 
learning situation which needs to be changed in order to accommodate the 
materials? Or do the materials need to be replaced by others which will more 
adequately serve the existing situation? 

12 Are the materials appropriate to the particular curaculum? Any language 
leaching curriculum answers three interrelated questions: What are the teach- 
ing-learning objectives of the course? How are these objectives to be achieved? 
To what extent have the objectives been a v hieved and have they been appro- 
priate? So, a curriculum can be defined as an ongoing relationship between 
some specified purposes, the methodology of which serves as the means towards 
those purposes, and the evaluation procedures which inform us on the effec^ye- 
ness of the methodology and the appropriateness of our original purposes. Ma- 
terials need to be contextual ised within this curriculum process, they should be 
st <?n as one resource within it and not the main driving-force of the curriculum 
ltse'f. it is often the case, however, that materials will supply their own purposes, 
their own methodology, and they can provide criteria by which to assess learn- 
ers progress, judgj our own methodology, and reconsider the suitability of our 
own original purposes. So, a harmony is needed between the hidden curriculum 
of the materials we use and the actual curriculum within which we work. 

This article primarily explores the possibility of a positive, harmonious 
relationship between materials and the language teaching curriculum. First, 
though, we shall focns on the basic curriculum characteristics which can help 
materials to avoid the unrealistic ideal. Current interest in so-called functional 
language teaching offers an illustration of the problems of idealisation of ma- 
terials. 

If we examine purposes in such functional materials, we find first that 
the purposes they explicitly state are opposed to the purposes they actually 
imply. Second, the change in purposes they advocate is to be accomplished 
without parallel changes in methodology. The explicit purpose of these func- 
tional materials is to teach language as use by means of exposing learners to 
a sequence of langv .ge functions. Implicitly, however, the definition of such 
functions given in the materials is not a genuine view of language as use, where 
an essential characteristic is a variable and unpredictable relationship between 
function and form. In such functional materials, the functions are presented 
as if they could be static, separable and categori/able items, and are generally 
exemplified in some pre-selected sentences or prescripted dialogue. As such 
they lack the genuine contcxtualisation of real communication and have no 
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appropriate function other than as models. Thus, two myths are perpetuated: 
first, that there can be some predictable relationship, out of context, between a 
function and those forms to which it is arbitrarily yoked. Second, that functions 
are an established and systematic framework upon which the teaching of form 
can ,be based. So, the teaching of functions is erroneously confused with the 
teaching of the formal or structural nature of language. This apparent change 
»n purposes^ a sham, such materials offer little or no change in methodology. 
Essentially the same methodology which once served the teaching and learn- 
ing of language structure is assumed or proposed within many functional ma- 
terials. Apart from tasks and cxerc.ses involving idealised data designed to 
S»ve teachers and learners the impression they are dealing with language use- 
such as written dialogues and other unreal and often paradoxical texts— these 
materials imply that functions can bo presented in some valid sequence, learned 
as separable and ^stable items based on consistent patterns, and can be 
memorized like the phrase-book phrases they most obviously resemble.- 

Given the warning of this current example, what are the ways in which 
materials should be appropriate to the curriculum they serve? What are the 
questions that a teacher should ask of materials? 

1.3 Are the truiteriais appropriate to purposes? Here our first concern is with 
the specific target language, /and secondly \vi ♦he important characteristics of 
the learners who are taking4he course. 

14 Appropriate to Target TMngitage? Is the language of the materials derived 
from sociolinguistic analysis of the target performance repertoire? 3 

\This question implies that the data within the materials should be authentic 
to the target, they should come from native speakers and be intended for 
native speakers. We can also ask, however, what, this target performance re- 
pertoire represents or is dependent on. Examples of target performance— what 
the native speaker «vould say in this or that event—are particular uses of the 
target community's conventions governing meanings or ideas, the conventions 
governing how these meanings should be shared and negotiated in interper- 
sonal behaviour, and the conventions which govern language form, or the text, 
which realises such meanings and such behaviour, target performance, there- 
fore, manifests the native speaker's knowledge of conventions which govern 
communication I eh example of target performance represents a meeting-point 

-A comparison of traditional structural-situation.*! matenals with more modern functional 
materials using the guidelines offered in sections 2 & 3 of this article will reveal these and 
other significant similarities There are other misleading as.sertions and assumptions. One of 
these is that the communwatnc isc of language equals a set of functions and that it is to he 
achieved primarily ( and on occasion exclusively ) through oral-aural practice. We have to 
deduce . naT communication doe r not hike place between writers and readers. 

1 A per c onnance repertoire refers to selective use of language which any curriculum 
takes as its purpose, however extensive and general or restricted and specific such a repertoire 
may Ik*. Even native speakers control' only a .selec tion of varieties of a language, and Mich a 
repertoire will reflect the communicative performance experience, and demands of that speaker 
Language teaching can be seen to he concerned with such target repertoires in the socio- 
linguistic sense (Cumper7 1984) rather than with some global or ideal target language, 
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of these three knowledge systems: ideational knowledge, interpersonal knowl- 
edge, and textual knowledge. 4 

Examples of target performance also involve the communicative abilities of 
the users of the language. Communicating, either written or spoken, calls upon 
the ability to interpret, express, and negotiate meanings. The skills, either taken 
together or individually, depend upon these three crucial and underlying 
abilities. Reading, writing, listening and speaking are the means through which 
the abilities to interpret, express and negotiate act and become refined. 

The distinction we have just drawn between the specific target performance 
repertoire and its underlying target competence leads us to replace the original 
question with the following: 

1) Are the materials authentic to the target performance repertoire? 

2) Are the materials authentic to the underlying knowledge and abilities which 
the target performance repertoire represents? 

If -authenticity to the target. is a criterion we wish to impose on m terials, then 
1) would demand actual data from the specific target repertoire which is the 
purupse of the curriculum; 2) on the other hand, would demand any authentic 
data which themselves represent or draw upon those underlying knowledge 
systems and abilities which underlie the target in question, but which could 
in principle underlie a range of different target performance repertoires. Al- 
though different target repertoires of different curricula may well share com- 
mon features, there is likely to be even more overlap between the target com- 
petencies which underlie them. So, the second question would imply that there 
need be no one-t )-one relationship between the data in the materials and the 
actual target performance repertoire. Authenticity to target in terms of this ques- 
tion implies authenticity to the native speakers communicative knowledge and 
abilities. 

The distinction between (1) and (2) above leads to a third question: 

3) To what extent do the materials need to be authentic to the target, in either 
sense of the term? 

Materials can be seen as a means to the target rather than necessarily embody- 
ing the target itself. Can materials serve the process of teaching and learning 
rather than the product of teaching and learning? If so, then materials should 
serve as a link between learner and target and to do this they will have to draw 
on both the target and what the learner brings initially to the course, in order 
to exploit the potential relationships between the two. Here we require authen- 
ticity to both target and I nx Now the question becomes one of discovering 
what materials can draw on possible relationships of the two. They ran either 
build on these possible ■ "lationships between the learner s initial repertoire and 
the target repertoire, or they can mediate between his initial competence — his 
' communicative knowledge and abilities— and the target competence. Small 

4 IThesT terms derive from HiilluLiy ( 1973) For further discussion of these knowedge 
systems and their implications for the language teaching curriculum, see M. P. Breer and C. 
N Candhn (forthcoming). 
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overlap amo:g competences will most likely be greater than that between 
repertoires it makes sense for materials to be primarily concerned with develop- 
ing the learners underlying knowledge and abilities* In short, to focus on the 
learners process competence, and be less directly devoted to target performance 
data. Here for us lies the possibility of distinguishing Process Materials from 
Product Materials. 

We have, as a result, several criteria we can call on when considering how 
materials might he appropriate to target language: 1) authenticity to target 
performance repertoire, 2) authenticity to the knowledge and abilities under- 
lying such a repertoire, authenticity to target competence, or 3) seeing materials 
as a means to the ultimate purposes, as serving the process of teaching and 
learning and representing a link between learner and target. These criteria are 
not necessarily exclusive: We may well demand that materials meet all three 
criteria to some degree. That degree would depend on answers to other kinds 
of questions we may ask of materials. One such question will be: in what ways 
should materials be appropriate to the learner who is entering the course? 
1.5 Appropriate to Initial Learner Characteristics? It is an important truism 
that materials should go from learner to target. Whatever the purposes of the 
curriculum we need to consider therefore what the learner cs l initially con- 
tribute to those purposes and what will be the learners own expectations of 
the curriculum. We do this for three reasons: first, to avoid idealisation of the 
curriculum and its materials, far removed from the actual learners; second, to 
exploit those potential contributions of learners which will be appropriate in 
various ways to the achievement of the target, third, to acount for likely dif- 
ferences between the material designer's view of the purposes and the individual 
learners definitions. Such contributions and expectations will, of course, vary 
between learners and, significantly, vary within the learner as the learning 
progresses. Purposes of materials will need to accommodate this heterogeneity 
and variability in learner contributions and expectations if they are to link 
learner and target. In assessing the appropriateness of materials in relation to 
these learner characteristics, we can ask: 1) Is account taken of the learners 
initial repertoire? (e.g. aspects of the mother tongue, aspects of the target re- 
pertoire the learner may imperfectly know etc.) To what extent? In what ways? 
2) Is account taken of the learners initial competence (initial communicative 
knowledge and abilities)? To what extent? In what ways? 3) Is account taken 
of relevant personal characteristics of the learner (e.g., age, sex, social and 
cultural identity etc.)? To what extent? In what ways? 4) Do the materials 
accommodate the learner s expectations of what language learning requires and 
the learners expectations of what the target requires? (These expectations may 
stem from the learners previous educational experience, and they are powerful 
factors in the learners definition ot the learning task) 5) Do the materials 
allow for the learner's own definition of learning needs? Are the learner s own 
specific interests and motivation accommodated? Not only are needs, interests 
atftl motivations distinct, they both vary among learners and over time as the 
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learning progresses, How do the materials cater for this differentiation and 
variation? 

In orders then, to reflect the particular purposes of the curriculum, materials 
must account for^he specific target language and for specific learner character- 
istics. Target langu^fccan be viewed in terms of a performance repertoire or 
an underlying competence. Learner characteristics can be viewed in terms of 
particular contributions and expectations. Purposes in any curriculum, and the 
materials within it derive from these two main sources. Materials need to reflect 
both if they are to avoid idealisation. The role of Methodology is to draw these 
two sources together to enable the learner possessed of initial and changing 
characteristics to come to terms with the target. 

1.6 Are the materials appropriate to Methodology? This is the central concern 
of this article. We have already argued that, in considering the question of 
authenticity to target, materials can be identified more directly with the teach- 
ing-learning process than with the product of teaching and learning. This argu- 
ment importantly implies that the evaluation, adaptation and innovation of 
language teaching materials must be based primarily on methodological criteria. 
Our emphasis on the appropriateness of materials to initial learner character- 
istics further implies that materials should start from the learner aiid serve as 
a link between the learner and the target Also, the idealisation of many ma- 
terials stems from inappropriate purposes and a methodology which is insensi- 
tive to the process of teaching and learning since such materials frequently offer 
model target data (often idealised in itself) as the driving-force of the cur- 
riculum, requiring learners to act upon this product through imitation and 
practice. The process of teaching and learning, involving the development and 
refinement of the learning, is paradoxically absent from the materials themselves. 

What methodological criteria within any curriculum should materials re- 
flect? We need first to see the merhodology of the curriculum in terms of two 
interrelated elements: content and process. These are both exploited by method- 
ology as the dual means to the curriculum purposes. Content in methodology 
is the link between the content of the target language and that content which 
the learner brings to the learning. We can additionally see content as that which 
the teaching-learning process works upon. In this way, content is the servant 
of the process within methodology. Process is what teachers and learners under- 
take during language learning and it represents the language teaching cur- 
riculum in action in the classroom. - 

We can identify the methodological criteria for materials within a set of 
principles for the selection and organization of content on the one hand, and, 
on the* other, in a set of principles for the design and selection of teaching- 
learning activities as representative of the process, Our point here is that ma- 
terials will be appropiiate to any language teaching curriculum to the extent 
that they serve as resource within the methodology of that curriculum. Mate- 
rials which are primarily driven by purposes idealise themselves away from 
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the language teaching process; as such, they cannot justifiably be seen as lan- 
guage teaching materials. 

1.7 Are the materials appropriate to Evaluation? Every curriculum adopts its 
own Evaluation procedures. Materials can serve as a possible source of evalua- 
tive criteria within the curriculum, both by focussing on the evaluation of 
learner progress and on the curriculum itself, in terms of assessing the effective- 
ness of methodology and the appropriateness of the original purposes. 5 

Evaluation can be either mi initiative (or end-of-eourse) or formative (on- 
going and guiding), and materials can be exploited as a means of evaluation k 
in both these roles. What is important is that the selected evaluation procedures 
of the particular curriculum must determine the ways in which, and the extent 
to which, the materials aid evaluation. The alternative to such assessment is to 
continue with evaluation-driven materials which are idealised from the process 
of teaching and learning in the same way as purpose-driven materials. Teachers 
are well enough aware how those materials which derive trom externally stan- 
dardised and imposed tests and examinations come to control and prescribe 
every aspect of the curriculum. 

Jf we now ask the question of the ways and the extent teachers might wish 
materials to aid their evaluation procedures, we f.re led tc consider several re- 
lated problems. Are there criteria within the materials useful as a basis for 
evaluating learner performance? Are such criteria, present, sufficiently explicit 
to different learners so that the materials can act as a stimulus for learner self- 
evaluation and for evaluation by the learners of themselves as a group? This 
latter requirement would make materials contain criteria which can be differ- 
entially met by different learners. Materials in such a case would not assume any 
single' or limited set of criteria for success, but allow for differential success. To 
what extent and in what ways do the materials offer feedback to the learner? Do 
they give clear guidelines on the direction learners should take after success 
or failure at a task? Are such guidelines and such feedback premised upon 
predetermined criteria, or do they allow for different learners achieving different 
tasks, in different ways and at different tin.es? SI /hi Id the materials provide the 
means for summative and or formative evaluation? Can learners apply their 
own evaluation to the contents arid activities of the materials? If we accept the 
significance of personal sock -cultural attitudes in language leaniing, and the 
importance of learners' value and emotions, then this becomes an important 
question for materials designers. We are aware that learning to communicate 
is a highly evaluative activity. Requiring learners to confront language use as 
if it was devoid of such affective characteristics is a form of idealisation. This 
is especially so if we wish learners to draw upon their initial knowledge and 
experience in language use, with its strong affective involvement. We could 
then demand of materia' s that they be open to learnei evaluation by continually 
involving the learner s attitudes, values and ''motions. Indeed, we might expect 

5 The present article can he seen as an example ot such curriculum evaluation m that it 
foeusses on one resource whuh, as we have said, A houltl nurroi the whole curriculum. 
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the materials to stimulate learner judgments about content anc to provide 
activities which are characterized by strong attitndinal, evaluative and emotive 
potential. An outcome of such learner evaluation, of course, is an evaluation 
of the materials in terms of their own appropriateness to the learner's needs, 
interests and motivations in learning the language, and their effectiveness in 
helping the learner to learn. Since learners do evaluate the usefulness of mate- 
rials against a range of socio-cultural and personal criteria, perhaps materials 
should exploit this exploitation, rather than assuming that they can be value- 
free and infallible. Such exploitation would acknowledge the strongly affective* 
character of language in use and language learning, and ought therefore to be 
seen as a strength rather than as an admission of limitations. 

2. Proposals 

We have argued above that materials which are sensitive to the .situation 
of teaching andjearning, and whi ch are a ppropr iate to_ the. particular language 
teaching curriculum, will be likely to avoid idealisation. We put forward pro- 
posals in this section directed against such idealisation and towards an ap- 
preciation of language learning and teaching as a communicative process. 

2.1 Materials serve MethothTogy. Language teaching materials should primarily 
serve as a resource within methodology, and therefore the specific purposes or 
evaluation procedures of a curriculum are secondary concerns in materials 
design. Materials will help teachers and learners by incorporating content as a 
carrier of the teaching-It .ming ficeess and by promoting activities and tasks 
which activate this process (see Section 1.6 above). Materials can act as a 
means to some specific purposes and as a means for evaluation, but they will 
do so for other than purely methodological reasons (e.g., they will contain 
content which derives from learner definitions of their longer term needs, in- 
terests and motivations). Materials design will be based on process criteria of 
teaching and learning rather than on product criteria (see Sections 2 and 3 
which follow for a discussion of such criteria). 

2.2 Authenticity is relative. There are three routes to authenticity in materials: 
1) authentic data taken from the target performance repertoire (commu ucatiou 
by native speakers for native speakers in the target community): °<) authentic 
to target competence in that it derives from the knowledge systems and com- 
municative abilities underlying the particular target performance repertoire or 
a range of such repertoires: and 3) authentic to the process competencies of 
different learners during the learning-taking process competence to refer to 
underlying knowledge and abilities but those which the learner develops from 
initial competence during the process of teaching and learning. 0 

In materials design, therefoie, we would propose: 1) that authenticity to 
the target repertoire is neither necessary nor sufficient foi tn< * teaching-learning 

fl To these three we could add Widdowson's 'authentication' (Widdowson 1978) by which 
learners 'make authentic* subjectively data which are presented to them Thus data can be 
authentic to a text-type (say a poem) but if it is not read as a noewi. but as a source of 
grammatical information, it is not treated authentically. 
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process; 2) that authenticity to target competence may be necessary but hot 
sufficient for the teaching learning process, 3) that authenticity to different 
learners' process competencies is necessary but may not be sufficient for the 
teaching-learning process, and 4) that authenticity to both target competence 
and different learners' process competences is both necessary and sufficient. 
(Note that the involvement of target competence within materials is justified 
by the Purposes of the curriculum, not its Methodology. 

2.3 Materials will allow for Differentiation. If we accept that authenticity to 
the learner within the teaching-learning process is a valid criterion in materials 
design, then differentiated materials are the necessary outcome. We would argue 
that difTerentiation needs to be accounted for in the following ways: 1) in terms 
of selection^ and organisation of content and the type and range of activities and 
tasks; 2) in terms of particular routes available to learners through both con- 
tents and activities- 3) in terms of particular pace or rates of learner progress; 
in terms of the particular medi » through which learners (as individuals or as 
members of sub-groups) act upon data; 5) in terms of the learner strategies 
(differential use of abilities) exploited variously by different learners even in 
the undertaking of common or shared tasks; and 6) in terms of the criteria 
applied by learners to the evaluation of their relative success or failure at par- 
ticular tasks or undertakings (i.e., a range of possible formative criteria which 
learners may use to establish their own individual or group criteria for success 
or failure). 

2.4 Materials can be a Product of the Process. Designers and teachers often see 
materials as a before-the-event necessity. Most often materials are seen as input 
for teaching and learning. Frequently this leads to unquestioned dependency 
on such materials, regardless of their effect. As an alternative, it is possible 
to see materials as emerging from teaching and learning, as ongoing achieve- 
ments of the process. If we can accept the proposals above on authenticity and 
differentiation, then learner-created materials are a likely outcome. This out- 
come has another justification in terms of classroom management; materials 
created jointly by learners and teacher during the learning will act to reduce 
the obligation on the teacher continually to search for new and ready-made 
materials. Materials which are the product of one course or one activity within 
that course can be recycled later within that same course or within the process 
of another course. So, if we see materials as a resource within the teaching- 
learning process, we may also see them as a resource created by that process. 7 

3. What the Contents Page Might Not Reveal 

We have been arguing that most language teaching materials emphasize 
content and take process for granted. Content has been seen as the si bstance 
of teaching and learning ard in consequence methodology has acted as an 

7 It is dear also that any process has* within it a series of tangible (and in principle 
evatttatable) products, open to inspection by the learners and the teacher, which drive the 
process. It is, after all, by being sensitive to thest that the process changes direction. 
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inculcator of content, essentially as a device for force-feeding inventories of 
formal or functional items into the learner. Process was assumed by and sub- 
servient to content. As an alternative we have proposed that the methodology 
of a curriculum would not separate content from process and make the latter 
dependent on the former but, on the contrary, content would be made the 
servant of the teaching-learning process and acted on by that teaching and 
learning. Within methodology, content and process continue to be related and 
continue to help and influence each other, but content now adopts the role of 
carrier of the learner's process competence and the provider of data through 
which learners select and develop their own means towards the target com- 
petence. 

A chosen methodology in a curriculum can, as a result, provide criteria for 
twb linked tasks. It can offer criteria within the set of principles for the selec- 
tion and organization of content, the choice of which will be defined by the 
process being served. In addition, methodology can offer further criferia within 
a set of principles for determining the type and range of the teaching-learning 
activities which will work upon the content. It should be clear, however, that 
in most materials the principles which have been applied an^ the criteria used 
in their design are likely to be implicit rather than explicit, and in some cases, 
confused and inconsistent. In evaluating materials, therefore, we need to iden- 
tify these implicit as well explicit criteria, an< in adaptation and innovation 
we should strive to be consistent to the chosen criteria. 

This section examines some principles underlying the selection organiza- 
tion of Content; Section 4 will go on to examine those underlying the activities 
promoted by materials. 

3.1 What is the implicit theory of Language in the materials? Is the view of 
language that of a structured system of formal items— grammatical, lexical and 
phonological — much as in an analytical grammar? Is it a view which sees lan- 
guage in terms of functions and the uses to which these formal items are put, 
and therefore still a view of some finite set of items to be accumulatively 
learned? Is it, on the other hand, a view of language as communication, charac- 

* terized by variable relationships between form and function: «a view which 
sees language in terms of a dynamic process of the sharing and negotiating of 
meanings through the shared and recreated coin rations governing particular 
communicative performances? 

3.2 What is the Focus of the Content of the materials? What is the "content" 
of the content? We should distinguish here between the explicit focus and the 
implicit focus of the content. The author of the materials may adopt a par- 
ticular theme or a series ot specific situations as the explicit content, but the 
important implicit content may well he a sequence of categorized grammatical 
patterns or functional items. 

3.3 How are the Contents Sequenced in the materials? What seem to be the 
criteria for sequencing adopted by the anthoi both foi the materials as a whole 
and for the units or sections Vfcifhin the materials^ Docs the author, for example, 
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base sequencing on some serial progression from simplicity to complexity, or 
from frequency to infrequency? As an alternative to serial progression of either 
kind, the sequencing may be cyclic or modular. As such it may not necessarily 
depend on any linguistic continuum of simplicity to complexity or from fre- 
quency to infrequency. The sequencing may be carried by some thematic devel- 
opment, or by interrelated activities, or by a developing repertoire of inter- 
* dependent skills. So, might the sequencing take the form of gradual expansion / 
from a core, the learnei s knowledge and abilities being simultaneously refined 
within frameworks which develop from a core of knowledge and abilities? 

3.4 How are the Contents Subdivided in the materials? On what basis has the 
author broken down *he content into units, and into parts and sections of units? 
What are the criteria on which chapters, units, sections, lessons exercises or 
parts of exei rises subdivided? Typically, such subdivision may involve serialized 
structures, 'functions' or 'acts', and may be explicit or implicit Explicitly the 
subdivision may involve episodes in some narrative, or a sequence of situations, 
while implicitly the subdivision may be by sentence structure or function. 

3.5 How is Continuity maintained in the WMterials? How is the learner's progress 
guided from one part of the materials to the other? In what ways is a particular 
aspect of content earlier in the materials reinforced and/or refined in a sub- 
sequent part or parts of t\ie materials? Is this reinforcement/ refinement achieved 
by following some incremental, accumulative pattern of items, or an expansive, 
aggregative development which is retrospective and prospective? Is the con- 
tinuity presented through formal items, or are other potential sources exploited 
such as, for example, the knowledge systems or skill' use? Again, as with focus 
and subdivision, we may need to distinguish between explicit and implicit con- 
tinuity offered to the learner. 

3.6 What direction should the learner follow through the materials? Is the 
direction the user should follow made explicit, or is it once again implicit? 
More precisely, is the learner assumed to follow the direction of the materials 
in some predetermined way from the beginning, through the middle, and to the 
end? Alternatively, is the learnt r (and the teacher) able to take any of a 
variety of directions? Is the direction to be taken open to discussion and choice, 
or is it closed and prescribed, preventing entrance and exit -at any chosen point? 

3.7 Proposals, In the foregoing section we have set out some questions which 
can be asked in evaluating the content of materials. Here we offer a number 
of proposals concerning content which may guide its adaptation and innovation. 

Theory of Language. We have already indicated that language as com- 
munication is characterized by the simultaneous realization of the ideational, 
interpersonal and textual knowledge systems. That, moreover, the social and 
interpersonal character of communication guarantees that it will be permeated 
with personal and socio-cultural values, attitudes and emotions, that, in con- 
sequence, learners of a language arc involved in a process of sharing, negotiat- 
ing and creating conventions against a background of p/evious and ongoing 
psychological <and social experience. This view emphasises the interplay among 
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the knowledge systems and the dynamic relationship between them in particular 
social events. In turn, this assumes that communication involves certain basic 
abilities of interpretation, expression and negotiation which act upon and are 
refined by the knowledge systems. The abilities are themselves realised through 
and refined by the surface skills of reading, writing, listening and speaking. 
Content, if it is to reflect this view of language as communication, should there- 
fore act as a carrier of these related components which underlie communicative 
perfo.mance. 

Focus. It follows from the above that the focus of content within a com- 
municative methodology will be on knowledge which is both cognitively and 
affectively significant to the learner and which demands negotiation in an inter- 
personal context. Content will also focus on cominu nidation as a part of, and 
related to, other aspects of human behaviour. 

Sequence. Accepting that the communicative process requires us to deal 
with variable conventions, we cannot assume that any step-by-step o*>accumula- 
tive sequence of content will necessarily be approp iate. The ordering of con- 
tent will be affected by learners* alternative routes. Sequencing, as a result, 
derives from the state of the I^arner(s) rather than from any implicit logic of 
the content itself. Simplicity and complexity, frequency and infrequency, in any 
absolute sense, cease to have any value in a communicative methodology. As a 
result, sequencing must be a cyclic process where learners continually develop 
related aggregations of knowledge and refining the use of abilities, rather than 
accumulate separable blocks of static knowledge. ^Content becomes something 
to be moved into, out from, and returned to, by learners in a process of finer 
analysis and refined synthesis. Curriculum designers cannot, therefore, predict 
with any certainty the levels of content upon which learners will decide to 
evolve their own sequence in learning. They can only anticipate a richness of 
content which will activate the learners* process competences so that the ulti- 
mate target repertoire(s) become accessible to learners and the particular 
demands of such repertoires become recognized. 

Subdivision, Given the communicative requirement to interrelate the com- 
ponents of the knowledge system — the ideational, the interpersonal and the 
textual — in infraction with the abilities involved in using this knowledge, sub- 
division in a communicative methodology cannot be based on serialized forms 
and functions. It has to be based on activities or tasks to be undertaken, where 
both knowledge a"d abilities are directed towards communicative performance. 
So, we are' not conct ned with units of content, but with units of activity which 
may guarantee this interaction within the knowledge systems, and between 
knowledge and the underlying abilities which work upon and derive from that 
knowledge. 

Continuity. Within a communicative methodology, continuity does not lie 
within an ordrring of formal linguistic items. It can be identified in at least 
four areas. Pirst, there is continuity possible from one activity to another and 
from one task to another. Activities and tasks set up their own requirements for 
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their progressive accomplishment and it is in the pursuit of these requirements 
that the learner finds tangible continuity. Second, continuity lies within com- 
municative performance during the learning and teaching, both in terms of 
communication in lessons which will include sequences of negotiation, and 
within the discourse of language data in terms of coherent sequences of utter- 
ances. Third, continuity is provided through the ideational system. At the 
general level the learner has access to continuity of theme, while at the par- 
ticular level the learner can ha\e access to conceptual or notional continuity. 
Because ideational continuity is realized through a progressive refinement of 
textual knowledge— refining a concept, for example, implies a refinement of its 
linguistic expression and vice-versa — there is parallel continuity of meaning, 
grammar and discourse. Fourth, continuity resides in a ski 11- repertoire or cycle 
of skill-use during an activity or task. For example, the achievement of a par- 
ticular communicative task might require a progression from reading to note- 
taking to speaking. 

A communicatee methodology would exploit these four areas of continuity 
as clusters of potential continuities for the learner, rather than expect one alone 
to be adequate. All of them can be inherent within any single task or activity. 
These alternative types of continuity offer two important advantages foi the 
materials designer. They can serve the full process competences of the learners, 
and they can allow for differentiation. We need to enable learners to discover 
their own continuity, and hence establish the criteria for their own progress. 
In accomplishing some immediate tasks, learners wil! impose their own personal 
and interpersonal order and continuity upon the task, the communication which 
the task generates, the ideational and ♦ i xtual data which they act upon, and 
the skills they need to make use of in the achievement of the task. As a result, 
the progressive refinement of the learner s own process competence provides an 
overall learning continuity. 

Direction. It follows from what has been said about sequence and con- 
tinuity withir. a communicatee methodology that content is not to be followed 
along some predetermined route. Nor is it the case that content exercises some 
external control and direction o\er the teaching-learning procedures. Choosing 
directions becomes part of tlie methodology itself, and involves activities of 
negotiation between learners and learners, learners and teachers, and learners 
and texts. Who or what directs content becomes a justification for communica- 
tion about the selection and organization of content within methodology and 
about the various routes to oe adopted by the learners through any agreed 
content. The direction of content, and the nature of the content itself, can be 
predicted within methodology only inasmuch as it serves the actual lean 'ng 
process of the participants in the group. Carrier content may be as diverse as 
the difierent Directions lea mors may take towards a common target perform- 
ance. 

4. Materials Working 

A;; with content, we can identify a ninnbei of principles upon which to 
O activities, and, as before, there will be different criteria guiding the ap- 
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plication of these principles. We should emphasise that our decisions' earlier 
concerning content evaluation, adaptation and innovation will be closely related 
to decisions concerning activities and tasks. Activities work upon the content, 
and content serves the activities. It is to be expected that there will be symmetry 
between the criteria, and the decisions resulting from those criteria, both for con- 
tent and activities in the materials. We should also take care not to identify the 
activities promoted by materials solely with those devices described as exercises. 
Every part of materials involves or implies an activity or a task on behalf of 
users. Thus, for example, reading a passage before answering questions about 
it is itself a task, as is listening to some audio data before working on them. 
Learners never passively react to data if their attention is directed towards 
them. In this sense there needs to be no real distinction between a piece of 
data and the tasks evolving from it. 

4.1 What is the implicit theory of teaching/ learning within the material? In 
section 1.4 we made a distinction between product materials and process ma- 
terials. We can make use of this distinction in considering the implicit theory 
or view of teaching/ learning which determines the overall methodology, of the 
materials. Those materials which are product-oriented assume that teaching is a 
matter of transmitting knowledge or information directly into the learner. 8 

Such a view is likely to assume that the knowledge itself will take a single 
path so that no real change in the knowledge occurs between the model data 
and the learner's own system. Learning becomes a receptive activity involving 
as little change as possible in the knowledge during its transmission and recep- 
. tion. Learners, in consequence, are all believed to learn the same thing at the 
same time in the same way. 

Process-oriented materials, on the other hand, assume that teaching is a 
matter of cooperative negotiation with learners, involving knowledge and in- 
formation in a process of^oint interpretation and shared expression. Instead of 
transmission there is a concern for communication. Knowledge is never a static 
object "out there" but varies and changes because of its intersubjective nature. 
So, learning not only changes the learners prior knowledge it also involves a 
continual change in the new knowledge. Learners in process-oriented materials 
are consequently assumed to be learning different things at difFerent times and 
in different ways. However, learning need not be seen as entirely subjective. 
Differential learning is itself subject to the sharing and negotiation of knowledge 
typical of an individual learner seeking confirmation that she/he is learning and 
has learned, both through further negotiations with the data, and in negotiation 
with the teacher and/or fellow learners. 

These alte~ stives of product and process are best seen as upints on a 
continuum, representing theories within which there is variation. 0 Nonetheless 

* For an interesting distinction between teaching seen as tuansmission and teaching seen 
as interpretation see Barnes (1976). 

"Process materials can be seen ar standing mid-way on a continuum between product- 
oriented materials and learner-oriented materials. For further discussion of such distinctions 
and such a continuum within a broader language- teaching curriculum framework, see M. P. 
Breen and C. N. Candiin (forthcoming). 
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we can ask which theory of teaching/ learning do the materials assume or imply, 
and thereby discover at least which orientation they favour. 

4.2 What classroom Procedures do the materials assume? Do the materials 
predetermine what occurs in the classroom? What do they leave for the teacher 
and the learners to determine for themselves? Do the activities involved by 
the materials assume a similar or recurrent classroom procedure from lesson to 
lesson or do they assume variation and diversity in procedures? Should teachers 
and learners follow the procedures laid down in the materials or are th%y free 
to choose their own procedures and alter the materials accordingly? 

4.3 What Participation do the materials imply? "Who does what?" in the ma- 
terials? What opportunities are there for individual, small group and whole class 
involvement in activities? Does the involvement vary and if so. for what reason? 
Is the teacher assumed to be a participant or is she/he excluded from such 
participation and accorded some outsider or directing role? 

4.4 What, Roles are given to the teacher? Is the teacher a transmitter of knowl- 
edge, a model to be emulated in much the same way as data within the mater- 
ials? Is the teacher an evaluator who shares the same criteria for success or 
failure as the materials? Is the teacher a guardian of how the learning ought 

-trrtake place and be achieved? In brief, ar<* the teacher's roles assumed parallel 
to those of the materials? Alternatively, is the teacher able to exploit the ma- 

" terials as £ co-participant with the learners, adopting roles different from or 
complementary to those of the materials? 

"4.5 What Roles are given to the Learner? Are learners explicitly or implicitly 
assumed to see the data in materials as models to be imitated, practised, stored 
( n memory and recalled at some later date? On the other hand, are learners 
expiated to explore and solve unanswered problems within the materials, to 
interpret data, hypothesise and evaluate, and be given the opportunity to go 
beyond the information given? Is the learner expected to have little relevant 
knowledge, to be limited in his abilities and skills, or do the materials encourage 
the learner to exploit the relevant knowledge, abilities and skills she/he already 
has? Are tasks to be seen as difficult and of increasing difficulty as the materials 
progress, or is the learner s initial competence engaged from the start? In short, 
where do the materials place the responsibility for learning: with the learner? 
With the teacher? With the materials themselves? 

4.6 Hotv do the materials involve the Learner',-; Process Competence? We refer 
here specifically to the roles expected of the learners by the materials, from 
an examination of the activities and tasks within the materials. What scope do 
they otter for activating learner competence, whether initial, process or target? 10 
Language learning, as in language use. depends on the simultaneous engage- 

10 See section 17 in this paper We may regard these competences as equivalent to 
the distinction made between the learner's mother tongue, interlanguages and target language 
For a more detailed discussion of the components of competence see M. P. Breen & C. N 
Candlin (forthcoming) 
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ment of underlying knowledge and abilities. Nonetheless we can accept that 
seme activities in the teaching and learning may focus on one or other of these 
components of competence. To do so is not to ignore the other components but - 
to refine them simultaneously since, as part of a unity, they are inevitably in- 
volved. In undertaking any specific task, all the components of the learners 
process competence will be activated to some degree or other. Thus, although 
the questions that follow focus on each of these components in turn, it is to be 
expected that materials will involve the learner's process competence as a 
unity «?t any point in th'» learning. t 

How and to what extent do the materials involve I' o learners knowledge 
systems? What ideational, interpersonal and textual knowledge must the lea/ner 
exploit, activate and develop in working with the materials f What demands are 
made on ideas and concepts, knowledge of interpersonal behaviours, knowledge 
of language form? How are the learners affects aroused? Is it an aim of the 
materials to engage the attitudes, values and emotional responses of the learner? 
Is she/he encouraged to evaluate and judge against a range of personal, inter- 
personal or social criteria? How and to what extent do the materials call upon 
the learners abilities? In what ways do the materials activate in teip rotation 
and expression? Is negotiation within and among learners, and between learners 
and teacher, promoted and encouraged? How are learners required to deal with 
tasks r" Are the abilities seen as interrelated or are they treated in isolation?, 
Are they seen as underlying the surface skills or directly equivalent to skills? 
Which sk' * an* exploited in the materials? Are listening, speaking, reading 
and writing to be used separately 01 is their potential overlap exploited? Are 
they treated in some externally 1 ioti\ated sequence, or exploited as a cycle 01 
repertoire within some specific taslv thus cap'tali/ing on the developmental 
interrelationship between them? 

4.t Proposals. Whet enter govern activities within communicative language 
'earning materials? The following addresses this question by emphasising learner 
actions with and through materials 

Negotiating. Activities and , sks in materials should encourage negotiation: 
both personal or psychological negotiation between individual learners and 
texts or tasks, and the interpersonal or social negotiation bei\u en all partici- 
pants in the teaching-learning process — between learners and teacher, and 
learners and learners. From a psychological viewpoint, negotiation is t\\t process 
of interaction between that which is known in ♦ 4 e learners mind and that 
new knowledge which she he confronts, From such negotiation both the oiiginal 
knowledge-* and the new knowledge are transformed and a synthesis accommo- 
dated in the learner's mind. From an interpersonal point of view, negotiation 
is a process o\ seeking and creating eorsensus between the ideas, aflecis and 
social identity of one participaujt and those of others within the co^Vmuiica- 
Hon Throughout such negotiation, and as t i lesult of it, ideas can be shared, 
affects recognised and social identities acknowledged Oik important outcome 
of the process of negotiation is that the conventions upon which the negotiation 
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will be initiated are themselves open to negotiation and recreation. 11 We can 
see, therefore, that both communication and 'earning— regardless of the knowl- 
edge to be learned—is a matter of negotiation. 

Communicating and Metacommunicating. Activities and tasks within ma- 
terials, as we have argued, .should promote communicative performance. But 
such communicative performance should draw upon and extend learner's process 
competences, thereby engaging the developing knowledge systems and abilities 
underlying performance through the use of the learners' skills. Both teachers 
and materials need to distinguish between any target communicative perform- 
ance and the learning of communication. Communicative performance in the 
classroom will be made up of both communicative and metacommunieative acts. 
The latter are those acts where learners jointly interpret, express and negotiate 
about learning and about communication. They can be of equal significance in 
the learners development of process competence as the use and refinement' of 
that competence through communicative acts. In consequence, materials need 
to provide for metacommunication as well as for communication. 

Authenticating. In order to be authenticated by individual learners anc 
groups of learners according to their own relevance criteria and their own 
developing process competences, activities and tasks within materials need to 
be differentiated, (see sections 2.2 and 2.3). Activities and tasks need to offer 
a 1 ■ -natives for who i> to undertake what, when and how. They should be 
sufficiently diverse and rich to engage different learner's knowledge and affects, 
and abilities and skills, in interrelated and supporJve ways. If they are called 
upon within the group, these differential contributions of the individual learners 
can he of benefit to a!. : tin members of the group Such differential contributions 
to -the^ group ha\e dual potential, they can provide the basis for authentic nego- 
tiation and they can allow learners to take on a shared responsibility for the 
Nte^adiing-learning Instead of presenting a problem, therefore, to the teacher, 
the diverse contributions of learners can be taken advantage of as sources for 
shared and agreed directions in the learning. 

Co-participating Activities and tasks in materials should involve teacher 
and learneis as co-participants in the teaching-learning process. Teachers and 
learners haw\ of course, their own specific contributions to make towards this 
process, but teachers and Kirn.'rs may be given equal opportunity to negotiate, 
evaluate and provide feed!* and to offer new directions for the process 
itself. Activities and tdsh lerefore he less concerned with providing 

answers than with p )sing j: i,i,„,.,s for joint interpretation and with encouraging 
the sharing of egression as springboards for other activities and tasks beyond 
the materials Simply, materials should be* a provider of potential for activities 
and tasks as well as incorporating their own. Materials can, as a result, par- 
ticipate as a creati\e resource within a network of resources — both human and 
material. 

11 For interesting discussion of ps> chologtcal and tntei personal negotiation see, for ex- 
ample, Neisser (1976) and Goffman (1972), 
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5. What New Directions Are Available in Materials Design? 

4.1 Language Learning rather than Language alone and Process rather than 
Content In the preceding sections we have emphasised communication as the 
overall purpose of the language teaching curriculum. Communication we have 
defined as a process of relating language forms and language behaviour in the 
context of social events. We have stressed that the conventions which link 
forms to behaviour are not fixed for all time, nor even certain among different 
participants in an event or across events but are variable and need to be 
constantly negotiated and accepted. Communication becomes a convention- 
creating rather than merely a convention-following activity, it is a social and 
interpersonal process. Learning to communicate is, as a result, not a matter of 
digesting a static and predictable body of knowledge, but learning how to 
interpret, express, and negotiate these conventions. As participants in this 
process, learners naturally start from their own communicative knowledge and 
abilities. Learning, for language learners, is the gradual development of com- 
municative knowledge and abilities through the process of communication. To 
be authentic to this definition of the learners task, materials will have a primary 
concern for the process of communication. Content in materials will, as a result, 
be redefined as*any information and data which serves communication. Further, 
since learners will impose upon such information and data different interpreta- 
tions, and will bring to them different contributions, authenticity to some par-^ 
ticular target repertoire become§ a secondary concern in materials design. The 
activation of r#6cess competence in the learner, using whatever content as a 
carrier of this process competence— possibly including selections from or aspects 
of some target repertoire— will take priority in materials design decisions. 
5.2 Activities rather than Models and Differentiation rather than Predictability. 
fn fulfilling *his concern with process competence, materials take on a new 
relationship with the learners and the teacher. Rather than directing the teach- 
ing and learning, they act as an interdependent resource between the learner 
and other learners and the teacher, and among all other available material re- 
sources. It becomes no longer necessary for particular materials to cover every- 
thing in the teaching and learning. They become an action-resource, serving as 
a means for sensitizing learners to the actual task of learning to communicate, 
a means for the observation and analysis of communication, and a means for 
the development of the knowledge and abilities which the learner contributes. 
As such, materials will be made up of units of activity with the overall objec- 
tive of showing learners how to learn. Units of activity will seek to engage 
learner abilities— and their realisation through learner skills— and set these to 
work upon the themes and concepts of ideational knowledge in the context of 
interpersonal behaviour. To capture and maintain involvement of this process 
competence, activities are likely to be of high affective content, continually 
challenging the learners attitudes, \alues and emotions I:* materials as units 
of activity, texts, of whatever medium and type, become the servants and means 
of process competence and cease to be ends in themselves. 
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Units of activity within materials will be Char izcd as a wide variety of 
different problems and will incorporate tasks of dtff ient kinds. There will be 
no explicit continuity and sequence between units of activity; activities and 
tasks will be heterogeneous rather than homogeneous. Each activity will serve 
as a stimulus representing an initial framework for the learner's prerequisite 
knowledge and abilities which need to be brought to bear on some particular 
task or tasks. Materials as constellations of activities need no beginnings, mid- 
dles, and ends in the usual sense. Of course, any activity will have a framework 
from which it plausibly begins, and it will lead to appropriate, through various, 
outcome or products. However, if negotiation characterizes the teaching/ learn- 
ing process, and it different learner contributions are exploited within the 
activity, there may b*» no single predictable route through an activity nor 
through the materials themselves. This differentiation of routes in the learning 
can allow for the varied -complishment of learner's individual and shared 
objectives. 

.5.3 Materials can pose Problems. Materials as an action-resource will be char- 
acterized by problem-solving. They will need to offer uncertainty, unanswered 
questions, and challenges to the curiosity of learners, rather than the accumula- 
tion of prescribed and unquestionable Jtems of knowledge. In essence, the 
activities will involve the learners in thp process of restoring equilibrium be- 
tween their initial knowledge and abilities and that knowledge and those abil- 
ities incorporated within anc 1 temanded by the activity. The materials will be 
necessarily incomplete, they wii* encourage the learner to search beyond the 
information and data given. Activties as problem-solving would encourage the 
learner to seek help through the teacher's and other learners' interpretations and 
expressions, through other material resources in the classroom (dictionaries, 
grammars, and other manuals, for example), and through relating what is re- 
quired of her/him in the real world beyond the classroom and to the appropriate 
available resources existing there. In this way, the activities within materials will 
serve as both starting-point and meeting-point for connections between' the 
information and data within the materials and the learner's experienced world. 
Like the classroom itself, materials can function both as an observatory of 
human communicative behaviour and a laboratory for the examination and 
analysis of such behaviour. 

Designing problem -solving activities for materials is, of course, a problem- 
solving activity in itself. If we adopt units of activity in materials design, what 
kinds of design decisions : re likely to confront us? Figure 1 generally illustrates 
the design process we mi' ht undertake in planning for an activity. Readers 
may perceive this process as exactly parallel to an activity they wonld actual!) 
piomote in the classroom. The authors would suggest that the particular prob- 
lem of activities design — and, therefore, materials design — could well be an 
appropriate problem-solving activity to be presented to learners during the 
teaching- learning process. 

5.4 Materials design can begin wi h Learner Negotiation. In considering new 
9 
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Figure 1. Decis ions in Designing Activities With in Materials 

Step 1 — Input 

1.1 What implicit objectives/solutions is the activity to promote? 
Is there potential for different objectives/solutions? 

1.2 What prerequisite learner contributions will be Jrawn upon? 

1.3 What information/data might be appropriate to the activity? 
la Content terms? (Section 3 of this article) 

In Process terms? (Section 4 of this article) 

Step 2— Formulation 

2.1 What is the initial framework for the activity which relates each element of the 
Input — potential objectives/solutions, learner prerequisites, and appropriate informa- 
tion/data? 

2.2 What is the nature of the problem(s) posed? Can learner recognize the problem 
and formulate or reformulate it? 

Step 3— Search/Tasks 

3.1 Are. individual learner interpretations of the problem — and the information/data 
supplied — allowed for? Are individual learner solutions promoted? 

3.2 How might the group or sub-groups undertake negotiation about individual learner 
interpretations as a means towards some group solution? 

3.3 Might individual meirbers of the group act as observers and evaluators—on the 
basis of the group's criteria perhaps — of (a) the solutions proposed and (b) the 
communicative process within the group? (i.e. individuals might act as task-* 
evaluators and communication-evaluators ) . 

Step 4— What Potential Outcomes? What Further Tasks? 

Either 4.1 (i) Sub-group expression of solutions proposed and sharing with other sub-groups. 

(ii) Evaluative negotiation between groups as to the relative appropriateness of the 

various solutions — involving some agreed criteria. 
Or 4.2(i) Group evaluative negotiation concerning the relative success/failure in (a) 

the overall activity/problem, (b) specific tasks, or (c) communication. 

(ii) Group/sub-group design on an instructional plan for either (a) solving the 
type of problem(s) dealt with in this activity, or (b) developing the specific 
communicative knowledge and abilities required by this activity which the 
#pup or some of its members felt they had not sufficiently developed. 

(iii) Peer teaching on the basis of such an instructional plan. 

(iv) Recycle and Transfer in terms of learner-designed activities/Droblems which 
build upon and are generated by this particular activity/problem. 

directions for materials design, we have so far suggested that materials can 
incorporate a view of language as communication; can directly reflect learning 
. as a communicative process, and can contribute to this process' through che 
provision of problem-posing activities. Each of these new directions imply that 
materials design may have, as its initial concern, the ways in which learners may 
interact with and through the materials. So, materials design could begin with 
the question: How might learners be encouraged to negotiate with and through 
the materials? Learner . negotiation— as w,e have indicated in section 4.7— can 
be both a personal and interpersonal undertaking. Such negotiation inter- 
relates with and builds upon the other communicative abilities of interpretation 
and expression. Accepting this interrelationship between communicative abili- 
ties, the materials designer can seek to provide answers to two further questions: 
2) How might learners interpret ihe materials and develop interpretation 
through the materials? and 3) How might "learners be encouraged to express * 
and develop expression through the materials? Each of these questions present 
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what is, perhaps, a new challenge to materials designers and teachers. We are 
obliged to uncover the complexities of the abilities of negotiation, interpreta- 
tion, and expression in order to activate them in direct ways during the teach- 
ing-learning process. 

We have previously suggested (section 1.4) that the four skills serve to 
realize these abilities and can act as a means to their refinement However, the 
abilities underlie and act through all or any single skills, so the latter are the 
observable representations of the abilities in action. Studies of reading and 
writing, and speaking and listening, have uncovered a whole range of sub-skills 
appropriate to each main skill and which combine together in its use. It is very 
likely that each of the abilities of negotiation, interpretation, and expression 
are each similarly made up of appropriate combinations of sub-abilities. The 
sab-abilities of interpretation, for example, may be a cluster or complex of sub- 
abilities which continually interrelate with one another in order to help an$ 
refine actual interpretation. So, interpretation itself may be seen to be made up 
of the following kinds of sub-abilities; 1) Attention and initial discrimination; 
2) Making sense (through global or general assimilation to particular analysis) 
and 3) Going beyond the given information /data (involving accommodation, 
inferencing, hypothesizing alternatives, evaluating and judging, and remember- 
ing). 

We may similarly define the ability of expression in terms of sub-abilities 
of the following kinds: 1) Producing or representing (involving recall and 
recreation and the proposal of new meanings in new ways); and 2) Transferring 
or generalising (involving relating different information or ideas or relating 
the known to the un known etc., transferring learning to other activities and tasks 
eta, applying; new knowledge and refined abilities and skills, and evaluating 
the outcomes f expression or the feedback it produces). These are air possible 
examples of a number of sub-abilities which may combine and interrelate in 
the' learners efforts to interpret and/or express. 12 

In seeking to develop one or other of the learner's main communicative 
abilities, materials may activate directly one particular suo-ability or a related 
combination of sub-abilities. The recognition of the various sub-abilities within 
the main communicative abilities could provide the materials designgr with both 
a richer and more* precise focus of attention and a wider range of potential 
activities and tasks. One new direction available to materials designers, there- 
fore, would be to begin from just these abilities and their sub-abilities* Since 
process-oriented materials emphasise the development of learner abilities — and 
their sub-abilities — we need to discover more precisely what they are. This dis- 
covery need not be separate from teaching and learning. It will emerge as a 
significant outcome from the teaching-learning process. In this way, materials 
design can be one of the ongoing products of classroom activities. This uncover- 

13 Limitations of space have prohibited us from discussing and examining these sub- 
abilities in any detail. For a closer consideration of them and their place in matt rials design 
*ee in particular Part 2 of M. P. Breen & C N. Candlin (forthcoming) and other sources re- 
ferred to there. 

ERIC 

114 • 



108 



The Learner in Focus 



big of abilities and this cycle of materials design, we have argued in this article, 
may most effectively emerge through a communicative methodology. 
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Frequency: A Criterion for Syllabus Development 

, . Joni Turano-P«rkint 

Southern Illinois University 

In sum, ten grammatical structures were arbitrarily chosen for this study. Nat- 
ural language data were gathered by taping a television talk show. Frequency 
counts were established for the ten structures by counting the number of times 
they were used. The frequency counts served the basis for a rank order of 
the ten structures. The frequency rank order was compared with the presenta- 
tion order in three ESL grammar texts. There was no significant overlap be- 
tween frequency and the orders of presentation found in the grammar texts. 

1. Introduction 

The EFL/E&L profession has experienced a profileration of teaching ma- 
terials in the last fivp or six years. So enormous has been the volume of pro- 
duced, works that both experienced and inexperienced teachers are often 
dazzled by all of the paraphernalia which "guarantees" production of a student 
who is fluent in English. 

But alas! How "expensively" disappointing and frustrating all of your 
efforts and time become when your best student responds to your question, 
"Where did you live in Iran?" with a perfectly grammatical, "two weeks ago." 
The WH-Question structure was taught only the day beforel 

Your immediate reaction leads you to re-examine the materials with which 
you have been working. A comparison of texts reveals similar exercises and 
explanations of usage and structure, but a different ordering of the grammatical 
structures in each set of materials. How did the authors arrive at their particular 
sequencing of structures? What criteria were used to order and even to select 
those particular structures? 

2. Background 

Such questions and observations led Ferguson (1966) to believe that many 
syllabi have been based upon impressionistic judgments and vaguely conceived 
theoretical principles. Charles and Agnes Fries (1961: 7) concur that "most 
of the sequences of the structured items, not only in the syllabus, but even in 
the textbooks are quite arbitrary^ with little or no justification/' 

Various morpheme acquisition studies dealing with frequency have led 
Dulay and Burt (1977: 118-119) to suggest that \ 

Frequency serves primarily to increase the probability that those structures which 
the learner is ready to process will occur, thus increasing the chances that the learner 
will be able to attend to and process them. If this is the case, one might expect that 
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the rate at which acquisition takes place may U» increased if maximal exposure of 

* a precise .sort at specific times over an appropriate period of time is provided 

It has also been pointed out by Larsen-Freeman (1974: 158) that 

. . . frequency of occurrence is some measure of the usefulness of a struct' ire and 
, furthermore, it is quantifiable and thus not left to the intuition of the individual 
author or t teacher. In general, items that are in frequent use need to be taught be- 
fore those that are more rare whether we are talking about formal it<'ms or gram- 
matical categories. 

In his book entitled, The Threshold Level for Modern Language Learning 
in Schools, J. A. van Ek (1977: 163) concludes that "we do not need to acquire 
grammatical patterns unless we intend to put them to immediate use. This 
may well be the criterion for selecting grammatical items." 

Results in a number of experimental studies with vocabulary (Higa 1965: 
172) determined that 

. . , familiarity is a significant variable in recognition, learning, and recall. The fa- 
miliarity value of a word is measured by the frequency of its usage which is found 

♦ in word counts such as The Teachers Word Book of 30,000 Words by Thorndike 
and Lorge. 

As Larsen-Freeman. (1974: 159) appropriately asked, "If vocabulary lessons 
have been based on frequency occurrence, why shouldn't syntactic structures 
as well?" \ 

In this paper, I will argue that*frequency counts of grammatical construc- 
tions in natural language use should play an important role in determining the *" 
order of presentation of grammatical constructions in ESL grammar texts. ' 

b. Methods and Materials 

The questions to be addressed are: 

1. What structures are more frequently use^ than others? 

2. How does a rank ordeiing of grammatical structures compare with the 
orders of presentation in ESL grammar texts? 

For purposes of this study, three popular ESL textbooks were used: English 
Sentence Structure by Robert Krohn, Using English Your Second Language by 
Dorothy Danielson and Rebecca Hayden, and Volumes One and Two of Modern 
English: Exercises for Non-native Speakers by Marcella Frank. 

The justification tor using these three textbooks in this study is that they 
are used in many different intensive language centers to present English sen- 
tence structure. Since the three textbooks were written with different purposes 
in mind, it would be unfair to the authors to expect them to present gram- 
matical constructions in the same order. 

To answer the aforementioned questions, I toped four hours of the Phil 
Donahue Show, which is a popular daytime television talk show featuring both 
well known and unknown people. The guests are informally interviewed by 
both Phil Donahue and the audience* Although I do not claim that the language 

ERIC il? 



Frequency 



111 



sample obtained from these interviews is the epitome of natural language, this 
particular program was sampled for the following reason*: 

1. A study such as this requires a large amount of data to make accurate 
inferences to general speech norms. It is extremely impractical to amass 
such large quantities of data on the street. 

2. Due to whit Labov (1972: 2<*9), has termed the 'observer's paradox/* 
substantial quantities of truly natural speech are difficult to obtain 
under any circumstance. 

3. Television talk shows, such as Phil Donahue do not rse prepared scripts. 
Although a particular topic may have been selected in advance for dis- 
cussion, the questions, responses, and comments are spontaneous. 

4. The speech used is accompairied by much extralinguistic meaning, 
paralanguage, kinesics, etc. which augment natural language. 

5. Previous research has determined that "conversation pr grams can give 
us a good cross-section of a population" (Labov 1972. 211). 

Frequency counts were calculated for the following ten arbitrarily chosen 
grammatical structures: 

1. 'to be' as a notional verb in simple present and simple -past forms 

2. present perfect 

3. present progressive 

4. yes 'no question 

5. passive 

6. relative clause 

7. noun clause (including wh-clauses) 

8. modal 

9. tag question 

10. possessive marked by the inflectional 's on the noun 

It should be noted that natural language use from a different source (i.e. 
a lecture or a soap opera) might have produced a different rank order *ased on 
frequency counts. For exarilple. one would expect an interview show to contain 
many questions. Yet. the yes no question did not hav? a high frequency of 
occurrence. If the data had been collected from a lecture or a soap opera, yes/ 
no questions might ha\e had a lower rank. 

In order to avoid possible confusion, the structures counted adhered to the 
definitions found in the three textbooks being examined. For example; some 
people linguistically recogni/e the utterances, "right" and "huh", when marked 
witli a rising intonation pattern at the end of a sentence, to be a type of ta;. 
Because none of the texts deal with this phenomenon, tag questions wfre 
counted according to the standard textbook definition. 

Interviews for the four particular shows included columnist and author, 
Ann Landers; comedian and actor. Steve Martin; politician, Edward Kennedy; 
and psychologists Werner Erhart, Dr. Nathaniel Branden, and Dr. WUl Schutz. 
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It should be noted that all speech (i.e. guest, audilnce, and Donahue) was 
analyzed. 

j 

il. Results 1 

Table 1 contains a rdnking from the highest frequency to the lowest fre- 
quency of the :en structures for each individual interview. It also lists a col- 
lapsed order of structures which will be called the final rank order of the four 
interviews. 



TABLE 1 

Correlation of Individual Interviews with Final Rank Order 



— ■ ..J— ... 

Final rank order 


Landers 


Martin 


Kennedy 


Psychology group 


to be 


1 


1 


1 


1 


noun clause 


3 


2 


3 


2 


modal 


2 


3 


2 


3 


relative clause 


4 


5 


4 


4 


present progressive 


6 


e 


5 


6 


yes/no question 


5 


4 


8 


5 


present perfect 


7 


7 


7 


7 


passive 


8 


8 


♦ 6 


8 


possessive ( s) 


9 


9 


9 


9 


tag question 


ro 


10 


10 


10 




rho =.98* 


rho = .96* 


rho = 94 # 


rho = .99 # 



* P < .01 



The data in Table 1 indicate that there is considerable agreement among 
the ranks. Each interview rank order when correlated with the final rank order, 
produced a rho which was significant at the .01 level Kennedy's rank order, 
more than any other single rank order, differed with the combined rank order: 
the rho between Kennedy and the combined rank order was .94 nonetheless. 

To determine the degree of agreement or association among the ranks of 
the columns of Table 1, Kendall's Coefficient of Concordance was calculated 
and is referred to as Kendall's W. Kendall's W expresses the average agree- 
ment, on a scale from .00 to 1.00, among the ranks. For the four ranks in Table 1 
the Kendall's W is .95. We can state, therefore, that the relation between the 
four sets of ranks is substantial. 

Such a high degree of correlation reinforces the hypothesis that the four 
rankings demonstrate a high degree of similarity that would not occur by 
chance alone. The highest correlation obtained was between the psychologists 
and the final rank order. The strongest degree of correlation occurs with the - 
group having the most interaction thus lending support to the final rank of 
structures. 

The textbook order was based on the first formal presentation of the struc- 
ture even. though the structure may have been used throughout the entire text 
and/or divided between chapters in its presentation. To be as a notional verb 
is not formally presented by Marcella Frank in Modern English. However, using 
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ia number ten rank for this structure cannot be justified since the verb 'to be* \ 
|s used throughout the entire first volume. As you can see, it was issued a rank 
Order of two since the use of this structure is stressed (although not taught) in 
^ section of noun agreement with verbs. 

j Table 2 lists the Spearman rank order correlation for each textbook order 
with the : final rank order. Krohn's textbook order approaches significance but 
dbes not achieve it. In order for there to be a significant correlation, the rho 
Would have to equal or exceed .683. Neither Franks nor Danielson and Haydens 
textbooks shows a significant correlation. 

i * 

TABLE 2 

Correlation ot Textl>ooks with Final Rank Order 



Final rank orde r Krohn Frank Danielson /Hayden 



to be 1 2 1 

noun clause 6 10 9 

, modal i 4 8 7 

/ relative clause 7 9 10 

present progressive 3 3 5 

yet /no question 2 8 2 

present perfect 8 4 6 

passive 9 5 4 

possessive ( s) 5.1 8 

tag ques tion 10 7 3 

rho -.61 rho = -.37 rho ==-.13 



Interview 

Landers 
Martin 
Kennedy 
Group 



TABLE 3 

Correlation of Individual Interviews with Textbooks 



Krohn 

rho =".65" 
rho = .66 
rho = .47 
rho = .62 



Frank 

rho^~-.28 
rho = -.27 
rho = —.22 
rho - - 3 



Danielson/Hayden 

" rW= -.05" 
rlio — .02 
rho = -.14 
rho = -.08 



TABLE 4 

* Structure Frequency and Percent of Total 



Structure 
to be 

noun clause 
modal 

relative clause 
present progressive 
yes/no question 
present perfect 
passive 

possessive t's) 
tag question 

**T©Utf num^r of COMPLETE SENTENCES UTTERANCES is 3895 
Complete Sentence* = 2183 or 56% Utterances = 1712 or 44% 



Frequency 


% of total 


1393 


36 


714 


18 


671 


17 


365 


9 


313 


8 


279 


7 


173 


4 


144 


3.6 


26 


.6 


8 


.2 




120 



114 



The Learner in Focus 



A Spearman rank order correlation of each interview with each textbook 
yields a nearly identical correlation with that of the final rank order and in- 
dividual textbooks. 

It was not the mtent of this paper to compare textbook presentations with 
actual structure usage in natural speech; however, I would like to note a few 
very obvious discrepancies between actual usage and textbook presentation. 

Forty-four percent of all the discourse has been labeled as utterances 
(partial o. incomplete sentences). Yet, none of the three texts contains one 
partial or incomplete sentence. 

As mentioned before, the verb to be as a notional verb had the highest 
frequency of occurrence, yet, Frank does not formally teach this structure. 

Thirty-seven percent of the yes/no questions we** formed by a rising in- 
tonation at the end of a sentence (i.e., John is sick?). Such a figure represents 
over one-third of yes/no questions; yet, none of the three texts teaches this 
very basic question pattern. Frank (1972: 76) does include a short description 
of the Informal Omission of Auxiliary as in the example, Need any money? 
which has been derived from, Do you need any money? By ignoring thirty-seven 
percent of the responses, all three textbooks are doing a great disservice to their 
users. « 

The tag question app Mkd only eight times out of 3,895 complete sentences 
and utterances. This represents only .2 percent; however, an examination of 
the texts reveals that the three authors consider this structure to be a fairly 
important one. Danielson and Hayderi have devoted their third chapter of 
approximately six pages to this structure while Frank and Krohn spend from 
two to three pages in formal presentation of tags. Had I included a type of 
tag by which a hearer confirms his statement with a rising intonation on the 
tag (i.e., You're going, r right \ ?), the frequency count for tags would have 
ihuh! 

been much higher. As previously stated, this latter use of a tag was not recog- 
nized by any of the three texts. 

Table 5 lists the three categories of noun clauses used in the interviews. 
The highest frequency occurs with the omission of that (52.3?), All three texts 
teach the optional use of that by placing it in parenthesis, none appeer to 
emphasize the use of a noun clause without that. 

Such a hrief overview of usage is in no way intended to substitute for a 
more detailed and thorough examination of structure usage. 

Discussion atid Implications 

I am aware that pedagogical implications are only assumptions at this 
point because there have never been any empirically based studies done to 
determine the effects of frequency structuring on the language learner. 

Wilkens (1974: 13) points out that 

, . . since natural language environments do not involve careful limitation and con- 
trol of linguistic forms and functions, adequate opportunity must be provided for 
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TABLE 5 

Structure Frequency and Percent of Structure Total 



Structure 



s? requency 



% of structure total 



To be 

a. Simple Present 

b. Simple Past 
Noun clause 

a. That Omitted 

b. T/wrt Expressed 

c. Wh-Clausc 
Mod ah 

a. Can 
Could 
Would 
Will 

Have To 
Should 
May 
Must 
Might 

Ought to, Got to, Lets, 
Seed to, Used to y Shall 
Yes/no questions 

a. Inversion 

b. Intonation 



b. 
c. 
d. 
e. 
f. 
g- 
h. 
i. 



1232 
163 

374 
164 

176 

164 
28 
173 
114 
89 
38 
19 
11 
10 



177 
102 



88.3 
11.7 

52.4 

23 

24.6 

28.6 

25.8 

17 

13.3 
5.7 
2.8 
1.6 
1.5 
3.7 



63 
37 



the learner both to experience the natural variety of language in use and to respond 
to the varied demands of normal language needs. 

If One brings this idea into the classroom, and more specifically, to the 
textbooks one is using, 

. . . there remains a feeling that it is the real language which should be taught. This 
feeling appears to underlie adverse criticism of the English used in course- books 
on the grounds that English [American] people do not talk like that (Lee 1977:216). 

Yet, *our knowledge of language in communication is too patchy for a thorough 
coverage of what would be needed" ( Wilkens 1974: 11). Without large amounts 
of empirical data to establish criteria for materials design, selection, and usage, 
we will continue to grope in the dark, and to spend millions of dollars on in- 
effective materials. 

Certain frequency is not the only criterion; furthermore, we must distin- 
guish the two effects of frequency structuring. According to Johnson (1977: 7-8), 

1. Infrequency of occurrence with reference to the student language needs 
may be a criterion for omitting a structure from a teaching programme, but 
cmce the decision to include a structure has been made, then supposed 
difficulty of acquisition seems more a valid criterion for deciding how much 
time should be spent on it. 

2. Secondly, by Using a graded grammatical progression the structural syllabus 
finds itself spending time on less frequent structures which delay the intro- 
duction of communicatively more important ones. 
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Until now, structural grading has been the most effective means of teach- 
ing language to the learner. This reasoning is valid, however, the criteria upon 
which we base this grading are faulty. Johnson (1977: 8) states that 

. . . justifications for a particular ordering which 'run 'we've taught the present, so 
now we'll move on to the pas..' 01 we've done the indicative so now we'll do the 
imperative' are *x)t justifications at all. They smack of a taxonomic linguistic descrip- 
tion . iade without regard for communicative needs 

It would be very presumptuous to assume that the authors of any text, 
including the authors of the three textbooks studied today, did not give a great 
deal of thought to the ordering of their structures^- Consequently, a clarification 
of the authors' reasons for Ordering the presentation of the aforementioned 
syntactic structures would be 1 immensely helpful. 

This leads us to the last question: What practical application might this 
study have in the classroom? For [\ -poses of brevity, I have listed the follow-, 
ing: 

1. Use of more natural language in the classroom by the teacher. Teachers 
should not monitor their own classroom speech in order to give the student 
a chance to hear * uncensored" language both in terms of grammatical errors 
and incomplete sentences. 
. 2. Use of textbooks that have self-contained units as the three we examined ' 
loday, This would allow one to use these books in a> y order. 

3. Do not demand near native perfection from the student. W$ must realize 



. . . using a language may he considered to he a performing skill, as with any other 
skill performing ability will vary greatly from individual to ir * vidua! H may be a 
waste of time to demand near native perfection from performers who will never 
be able to provide it. Our ami should be to ensure that the misuse of language is 
not so serious as tbseure communicative competence (van Kk 1977.164).* 

4. Encourage students to read, listen to the radio, watch television, and take 
part. Since we cannot teach them everything, we must be content to give 
them the basics for survival and the ability to understand in general terms 
what they hear and see. They will learn to teach themselves. 

In summary, my study fits into a schema for education in the following 
way. Proficiency considerations are of utmost importance in setting an in- 
structional program. We ask ourselves how much of the target language has 
to be taught, what arrangement and order of presentation of material must be 
made and how does that ord'-r and presentation of grammatical structures, in 
this case, affect a student's '.earning them and control over them? And finally 
how do we know that the student controls these structures? These aie questions 
which go beyond ihe scope of this paper. However, the questions addressed in 
this paper are crucial to the curriculum design of an ESL program which con- 
cerns itself with the attained language proficiency yf its clientele. 
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, In the wake of the theoretical debate about the merits and demerits of notional 
syllabuses, the few accounts of materia' development projects to appear re- 
cently can provide some timely insights into the feasibility of the proposals 
that have been made. This paper describes the thinking involved in an ongoing 
project aimed at developing a format for working on variation in language use 
in spoken English classes for intermediate-advanced EFL students at the col- 

■ lege level. There is first a discussion of the difficulties involved in working with 
a strictly notional syllabus schema, then a detailed description ot the type of 
syllabus that was eventually adopted: an "encounter syllabus" with an ethno- 
graphic dimension. The units of the syllabus are centered around a series of 
social encounters or speech events which form part of an overall narrative se- 
quence. The teaching-learning activities built around these encounters are de- 
signed to reflect the interactive aspect of discourse and to involve students in 
an inform^ exercise in the ethnography of speaking, both inside and outside 
the classroom. This article describes these activities in some detail, illustrating 
the way in which they effect student and teacher roles r * 

1, Background 

A great de«il has been discussed about the whys and wherefores of develop- 
ing language syllabuses which focus on communicative functions and language 
use. It : .s now generally agreed that there has been an overly narrow emphasis 
in language teaching materials on syntax, morphology and the referential func- 
tions of language. Innovations in course design, particularly at the intermediate 
and advanced level, are long overdue. Some interesting proposals have been 
made, Wilkins' proposal for a notional syllabus (Wilkins 1976) in particular 
has caused the most stir among teachers and course designers. Up to now, how- 
ever, therr have been relatively few accounts in the applied linguistics Jitera- 
ture of the insights gi.ined by those who have actually undertaken the task of 
designing syllabuses of this kind. From the point of view of both theory and 
practice, it is essential that accounts of the thinking that has gone into such 
materials development projects be made available, since it is only in a com- 
pjyativ light that the implications of the theoretical proposals for the actual 
design and, format of language courses can be fully assessed. 

The following »s an account of some of the thinking that has gone into 
an ongoing materials development project at Stanford Univeisity, a project 
undertaken to provide a new kind of format for working on variation in spoken 
discourse with EFL students at the intermediate and advanced level. Given the 
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current state of the art of syllabus design, there are only a limited number of 
alternatives open to the syllabus designer. I e\entually opted for an "encounter 
syllabus" along. the lines suggested by Candlin (1977), with a notional dimen- 
sion incorporated into each unit. After a brief description of the syllabus design 
I will demonstrate how an ethnographic approach falls very naturally out of 
this kind of metacommunicative framework. As I developed this approach, both 
in designing the syllabus content and in working out teaching/ learning activities 
in the classroom context, I found that it provided a particularly dynamic means 
of linking the spoken discourse and language activities in the classroom with 
the speech communities immediately beyond the classroom. The real advantage 
of an- ethnographic approach is the opportunity it affords for engaging students 
in an active learning process, for involving them in informal but systematic 
exercises in the ethnography of speaking outside the classroom: making ob- 
servations, documenting and comparing their observations and conducting sur- 
veys of language use. In this paper, I will describe in some detail the kind of 
teaching/learning activities that I incorporated into the syllabus. These are but 
a (ew examples of the kinds of activities that could go along with an ethno- 
graphic approach. I am convinced that there is still much more that language 
teachers and syllabus designers can learn from the work that ethnogranhers do. 
As Hymes has pointed out, the strength of the ethnographic app**oacn is that: 

It is continuous with ordinary life: much of what we seek to find out in ethnography 
is knowledge that others already have Our ability to learn ethnograplu?ally is an 
extension of what every human being must do. that is learn the meanings, norms, 
patterns of a way of life. (Hymes 1968 18 N 

2. Objectives 

I had several objectives in mind »n developing the materials. 

2.2 Linguistic content of syllabus: To focus on language use by A) dealing 
with the communicative functions of language, with the way people get things 
done with words in day to day language transactions, B) by coming up with 
a syllabus design which would reflect, as far as possible, the interactive aspect 
of spoken discourse; and C) by getting away from tne conventional pedagogical 
strategy of treating language as a homogeneous system: focusing instead on the 
stylistic sanation associated with differing degrees of formality. 
2.2 Teaching learning activities: To focus on learning bv doing: with listening 
and speaking as complementary activities by A) provid tudents with ample 
practice in interpretation of spoken discourse instead . r giving them ready- 
made explanations, and B) by involving students in task-centered communica- 
tive activities inside and outside the classroo, —to get them to use the language 
to learn more about it. 

3. Selecting a Syllabus Schema 

3.1 A notional syllabus? VVilkins has proposed that language syllabuses should 
be built around "general aspects of meaning and use" (WilHns 1976:21) rather 
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than around a set of grammatical structures or situations. He conceives of what 
he has called a "notional syllabus" as having: 1) semantieo-grammatical cate- 
gories dealing with, for example, ways of talking about time, space, motion, 
quantity, case relations, 2) categories of model meaning; and 3) categories of 
communicative function. 



Rational equiry and exposition 
Personal emotions 

Posi'tive 

Negative 
Emotion relations 

Greeting 

Sympathy 

Gratitude 

Flattery 

Hostility 



v Judgment and evaluation 
Valuation 
* Verdiction 
Approval 
Disapproval 
Suasion 

Inducement 
Compulsion 
Prediction 
Tolerance 
Argument 

Information 
Agreement 
Disagreement 
Concession 

h is this third component of notional syllabuses that presents the most problems 
for linguists and course designers ahke. WiJkins rightly points out that up till 
recently, language instruction has focused almost exclusively on the use -of 
language to report and describe. Intuition alone tells us, howevei, that "other 
communicative functions should be taken into account in the design of a lan- 
guage teaching syllabus. Language is also used to get things done, to express 
emotions, to build a rapport, to mark social distance, and so forth. Wilkins 
gives us a broad taxonomy of potential communicative functions and attempts 
to match these functions with a graded schema of syntactic forms which are 
potential surface realization of these functions, for example: 



Information sought: 

— request, question, ask. 



'Question*: 

Information seeking is likely to be an im- 
portant aspect of a learner s languag'e use. 

(a) Interrogatives 

(b) Declaratives, m + question intonation 

(c) Question-word questions 
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When 
Wheiy 
What 
Who 

What '(time) 
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How + 



far 
much 

? 
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'Request' 

Would you shut the window, please 
(Would you mind shutting . . .) 
(— Wilkins 1972) 

Willfes' overall taxonomy is largely based on Austin's inventory of speech 
acts (Austin 1962) which is essentially a list of perfprmative verbs or surface 
structure realizations of speech acts compiled on the basis of introspection. 
3.2 Problems involved in working with a notional syllabus schema. While 
Wilkins' proposal for a notional syllabus represents a Bold step forward in the 
right direction, I feel it raises a number of unfathomable theoretical if sues that 
make it unworkable in practice. There are bound to be problems involved in 
.carrying ever the abstract constructs of speech act theory to the practical t&sk 
.of syllabus design. Not the least of these problems is the danger of over- 
simplifying the relationship between language form and language function. 
Language forms can be multi-functional. Even a simple lexical item such as: 
OK can be construed by the hearer as 1) compliance with a request; 2) the 
granting of permission^ 3) the expression of approval, and so forth. For the 
most part, function can, only be understood with reference to the context of 
the discourse itself and to the communicative context; i.e., the relative status 
of the participants, their goals, background knowledge. 

Candlin (1976) has shown how the assumption that there is a one-to-one 
mapping between speech acts and their surface forms has led to the adoption 
of a static "item-bank" approach to syllabus design. He gives the following 
example: 

The usual procedure is to <ite ( speech ) acts with a performative label and to asso- 
ciate with them a number of different surface structure construction types. . . 
Suggesttot : Why don't you ? 

I suggest that 

We could 

Lefs 

How about ? . 

Can't- we ? - (Candlin 1976:249) 

The important point here is that presenting lists oi lexical items and gram- 
matical structures under the rubric of some loosely defined communicative func- 
tion provides the language learner with no more guidance than a tourist phrase- 
book constructed around ari ad hoc series of situations. The function of an 

utterance which begins with Why don't you ? can only be determined 

with reference to what went before in the discourse and to the appropriateness^ 
conditions for the production of the utterance. Thus, it can only be interpreted 
as a suggestion if a suggestion has previously been elicited, either verbally or 
non-verbalry, by the interlocutors ) who is/ are the referent(s) of you, or, if 
the context is such that the speaker has the authority to volunteer an unelicited 
suggestion. These are the kinds of pragmatic rules that would have to be in- 
corporated in a strictly notional syllabus if such a syllabus is to be devised! 
The other set of rules that 'need to be incorporated are those governing 

ERIC' 

127 



Language Use 



121 



stylistic vacation. Jn the examples of surface realizations of the function sug- 
gestion given above, there is no indication of which constitutes a more or less 
forma] alternative and in what kinds of communicative settings they might be 
appropriately used. 

Auother problem with a strictly notional syllabus* is the question of the 
boundaries of speech acts and their surface realizatipns. A speech act is not 
merely realized as a simple lexical item or utterance. As Coulthard has pointed 
out, in Wilkins' blueprint for a notional syllabus, "there is no indication of the 
size of the communicative acts, whether they are clauses, utterances, exchanges 
or transactions" (1977:141). 

Aside from th$se form-function issues, there is the practical problem of 
what and how many speech act types are to be included in a notional syllabus. 
Speech act theory offers no guidance on this score. Searle has implied that 
there may be some "basic illocutionary acts to which most or all of the others 
are reducible" (1969:69) but he offers few clues as to what these might be. 
The selection of speech acts for inclusion in a syllabus remains a practical 
problem for the syllabus designer. 

Most of the theoretical problems involved in working with a notional 
. syllabus schema stem from the tact that the pedagogical orientation is from 
a single abstract discourse function to its textual realizations. The rules govern- 
t ing the way in which a linguists form or constellation of forms come to convey 
a given communicative function cannot be made pedagogically explicit; they 
are complex rules involving reference to the presuppositional structure of the 
discourse itself and to the communj^ative context as well as to the shared 
knowledge of the interlocutors. 

Quile apart from the theoretical problems involved in working with a 
•notional syllabus schema, it also seemed counter-intuitive to peg the unit of 
a syllabus onto an abstract discourse function. I decided instead to adopt a more 
inductive approach: building the units of the syllabus around samples of spoken 
discourse and emphasizing interpretation rather than explanation. 

As I mentioned at the outset of the paper, this approach is closer to 
* what Candlin has called an "encounter syllabus/', For the students, the emphasis 
is on practice in interpreting spoken discourse encountered both inside and out- 
side the classroom. For the instructor, the goal is not to present ready-made 
interpretations of spoken texts but to explore with the students alternative ways 
in which the same functions might be expressed. The overall purpose of the- 
syllabus is to make students more aware of how the different aspects of speech 
events can govern what gets said and how it gets said. As CandHn puts it: 
Learners need what we might call encounter sylltbvies where they are progressively 
sensitised to, say, Hymesian coniponci *s <>f speech events so that they can them- 
selves analyze and compare the realization >i encounter types. ( 1977-xi ) 
4. The Syllabus Design 

The syllabus is built around a series of verbal encounters or speech events 
which fall into a narrative sequence. Each unit is labelled as if it were a new 
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episode in this sequence. The narrative is centered around two characters: 
a male and a female student who are both arriving for their first quarter at 
an American university. The narrative sequence follows the kinds of com- 
municative contexts and the kinds of language transactions that a foreign stu- 
dent arriving in thf United States for the first time would be likely to be 
involved in, as well as the kinds of topics he or she would be likely to talk 
about. In this way, the syllabus format is designed to reflect as closely as 
possible the potential language needs of a specific student population, language 
needs beii & broadly interpreted in Richterich's terms as stemming from an in- 
teraction of communicative functions with settings, topics and notions, as well 
as the actual grammatical and lexical structure of the language. (Richterich, 
1973) 

Narrative sequence 

The narrative sequence unfolds as follows: 
Unit 1. ARRIVING 

Asking for directions 

Giving instructions 
Unit 2. MAKING CONTACT 

Greetings 

Conversational openings 

Unit 3 MEETING NEW PEOPLE 

Introductions 

Self-introductions 
Unit 4. THE END OF AN EVENING 

Saying goodbye 

Making arrangements 

Unit 5. a Place to live and a bike for sale 

Asking for information 
Making requests 
Unit 6. REGISTRATION DAY 
Asking for permission 
* < Unit 7. AN APPOINTMENT WITH AN ADVISOR 
Advise 

' Making plans etc 

* In its present form, the entire course has fifteen units: the length of the course 
being dictated primarily by practical pedagogical considerations rather than by 
any of the linguistic criteria, discussed above. The narrative theme is a very 
loosely-organized, one, so that the units do not have to be followed in a rigid 
. sequence. Rather, they can be chosen according to the needs and the level of 
the students. The theme does, however, serve a number of useful purposes: 

1) It generates a set of 'texts'' of spoken discourse for interpretive practice; 
2) it provides thematic continuity and a means of linking gie language activities 
based in the classroom to the environment and to the immediate experience of 
the *tuden*s since the episodes portrayed in the dialogue represent the typical 
language transactions of daily life on a college campus; 3) the two characters 
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are portrayed in a wide variety of communicative encounters, both formal and 
informal which serves to highlight the stylistic variation which provides one 
of the main focuses of the course; and 4) the notion of formality is defined pri- 
marily with reference to the norms and oxpectations of the speech communities 
that foreign students are most likely to come into contact with. 
4.1 Notional dimension. A notional dimension is built into each unit. This does 
not mean that the entire unit is built around an abstract discourse notion, but 
one or two speech act types naturally occur in the speech events that form the 
basis of each unit. Thus, for example, in Unit 1, there ar* three samples of spoken 
discourse in the listening component: 

A. At the airport. One of the students has just arrived. She asks how to get to the 
campus. The airline representative gives her instructions as to how to get there 
by bus and then directs her to the bus stop. 

Speech act sequence: Asking for instructions 
Giving instructions 
Giving spatial directions > 

B. Near the campus. The other student is arriving by car. He stops to ask directions 
to campus. 

Speech act sequence: Asking for directions 

Giving spatial directions 
G. On campus. At the student union. The same student asks how he can get ahold 
of a student he knows who lives on campus. He is directed to the registrar's 
office (a complicated route involving going to the third floor of the adminii Na- 
tive building by elevator) and is told who to ask for. 
Speech id sequence: Asking for instructions 

Giving instructions 

Giving spatial directions 

In this way the speech acts or notions are presented as part of a pattern 
of exchanges in the context of a conversational interaction, not as isolated ut- 
terances. These instances of discourse are designed to capture the kinds of 
linguistic routines that are an essential part of a native speaker's communicative 
competence. Verbal interactions on the telephone are typically routinized, thus 
a telephone sequence is built into both the listening component and the spoken 
component of each unit. 

The motivation for incorporating a notional dimension is two fold: 

1) Intuitively and empirically, speech acts represent more natural units of 
discourse; and 2) the lexical labels for speech act types, such as suggestion, re- 
quest, apology, and so forth, form part of a folk taxonomy of communicative 
functions for English speakers. Since students coming to the task of learning 
English as a foreign language have similar, if not equivalent, taxonomies of 
communicative functions in their own linguistic repertoire, it seems only appro- 
priate to draw on this tacit knowledge in some way in developing materials 
tot teaching intermediate/advanced students. ^ 

4.2. Sociolinguistic aspects of speech events. At different junctures in the nar- 
rative sequences verbal encounters of differing levels of formality are introduced 
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within the same unit. The aim is to illustrate how different aspects of speech 
events, such as the identity, status and number o{ participants, the channel, 
the setting, the topic, the motives and goals of the participants, effect the way 
in which people choose to talk to each other. These aspects of speech events 
have been schematized as follows by Hymes: 

S — Setting 
P — Participants 

E — Ends (the motives and goals of participants) 
A — Act sequences ( message content or topic ) 
K — Key (tone, manner or spirit) 

I — Instrumentalities ((a forms of speech: codes, registers, dialects, and 

(b) channel) % 
N — Norms ( norms of interaction of interpretation ) 
G — Genre 

(Hymes, 1972:65) 

The aim of the syllabus is to develop students' awareness of how these factors 
influence the way that people use English in the kinds of speech communities 
they are likely to come in contact with. Broadly speaking, these communities 
have been defined as the American college/university campus and those social, 
bureaucratic and commercial networks typically associated with a campus: from 
the acquaintances, friends, administrative staff and professors a student would 
normally interact with on campus to the people a student would typically come 
into contact with in applying for a driver's license, in buying an airline ticket, 
in visiting a dentist or going around to garage sales. 

These aspects of speech events are brought to the students' attention in the 
first lesson unit. In the student workbook, thev are given the following set of 
questions, based on Hymes' schema. The questions appear on the first pages 
of the students' workbook and they are encouraged to add any other questions 
they consider to be relevant: 

\. Where is this conversation taking place? And when? Does this affect the way 
that people speak? (SETTING) 

2. How many people are taking part in the conversation? Are they the same age 
or sex? Do they have equal status? Or is one of the people more important than 
the other(s)? (PARTICIPANTS) 

3. Why Is the conversation taking place? What does each speaker want to express? 
Does he/she want the other(s) to do something? Is he/she expressing some 
feelings?. Or is he/she planning or promising to do something? (ENDS) 

4. What are these people talking about? Does this affect the way that they talk? 
(ACT SEQUENCES) 

5. Is this a serious conversation? Or are the speakers joking? (KEY) 

6. Is the conversation taking place on the phone? Are the participants speaking m 
a formal or an informal way? (INSTRUMENTALITIES) 

7. Is this the usual way to express this kind of thing? How does this compare with 
the way it would be expressed in your language? (NORMS) 
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8. Is this a special kind of situation, e.g., a lecture, a debate, or it is just an ordinary 
conversation? (GENRE) 

These questions are designed to guide students in their interpretation of 
the instances of spoken discourse they will be encountering both inside and 
outside the classroom, and, to help them judge appropriateness. They can be 
addressed to the class to stimulate a discussion, or to the instructor whose role 
would thus be essentially that of a native speaker informant. Outside class, the 
students ask the same questions to native speakers as part of a systematic exer- 
cise in participant observation and informal discourse analysis, which will be 
illustrated later in this paper. 

4.3. Syllabus content. The syllabus is designed to be as flexible as possible so 
that it can be used vyith studentsnrt varying levels of proficiency and in dif- # . 
ferent programs. Each unit is divided into four phases, then subdivided into 
com and optional components: a listening component, a spoken component and 
a writer follow-up. The sequencing of the phases is illustrated below with ref- 
erence to Unit 3: MEETING NEW PEOPLE. 

m 

PHASE I: Focusing Phase 

Listening/ Spoken Component Discussion of cross-cultural differences In verbal 
(and non-verbal) conventions for: 

(A) Introducing other people 

(B) Introducing oneself — When? to whom? how'** 

(C) Use of terms of address; addressing strangers, attracting people's attention, 
etc. 

(Questions provided in teachers' notes and in student workbook.) 

Written Follow-up: Students ar** asked to write a short paragraph on one of the 
above. 

PHASE II: Intensive Practice 

Listening Component: Students listen to short "texts" of spoken discourse. 
Speech act sequence: Greetings 

Introductions 

Identification of discourse participants 

The texts used here are scripted conversations which are designed so as to 
reflect spontaneous spoken discourses as closely as possible. The conversations 
are marked by false starts, long pauses, interruptions, the use of attention getters 
and other typical features of spontaneous spoken discourse I opted to use 
scripted conversations to illustrate clear, contrastive instances of stylistic varia* 
Hon and to include specific discourse features within a short stretch of discourse 
(about two to three minutes). These proved to be impossible to capture in an 
equally short sample of spontaneous spoken discourse. 

Spoken Component 

There are several suggested activities here, which are appropriate for dif- 
ferent levels of proficiency and for different types of classes. 
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A) Students ask and answer questions about the socio! inguistic aspects of 
the speech events. 

B) Students pick out lexical and structural items that are marked for 
formality or informality. These are placed on a rough continuum of formality. 
(On the board, nrthe student workbook, etc.) 1 

C) Guided practice with new lexical and structural items. The emphksis 
is on authenticity of language activity. ^ 

(t) Communicative "drills th&t provide practice with discourse se- 
quences: In this case, greetings — introductions — identification of discourse par- 
ticipants. Students work through the drills in pairs in class or in the language 
lab. 

(ii) Taskcenteted activities: These are of two types: 

a; Those centered around some non-verbal context, such as a map 
for practicing giving directions in a specific context: inside a building, for ex- 
ample. In Unit 3, this part of the spoken component involves a role-playing 
exercise centered around a map (of a city) and a telephone. The discourse se- 
quence is as follows: 

—self-introduction , 
— acknowledgement 
— invitation 
^ — spatial directions 

b. Those centered around some aspect of language use. For ex- 
ample, in Unit III students a sked to write a formal letter introducing them- 
selves to a professor who is tking in their field. The main objective of the 
Jetter is to ask for copies of several papers that have been written by this pro- 
fessor. The letter opens, nevertheless, with a self-introduction and the students 
aire asked to work in pairs on the opening paragraph. Then, in a similar arrange- 
ment, they are asked to write to a 'friend of a friend* here in the U.S. The 
sequence of the discourse is as follows: 

— self-introduction 

—announcement of impending visit to city where this person lives 
— expression of desire to meet this person. * 

This letter is an informal one. Although this is ostensibly a written assignment, 
it is included in the spoken component because the students are talking about 
stylistic variation, and, simultaneously making suggestions to each other, ac- 
' cepting and rejecting those suggestions and so on. 

Written Follow-up 

Again, in this component, there are a number of different activities sug- 
gested—to be selected according to the students level of proficiency, the type 
of class, the amount of time available, th^homework load and^o forth. 

A) Student workbook: The students answer questions about the soeech 
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events. The questions are designed to elicit answers containing the key lexical 
items and syntactic structures in the "texts" of spoken discourse; e.g., those 
marked for formality/informality, etc. 

B) Student workbook: The students note down those key items that they 
intend to incorporate into their productive repertoire. They can be encouraged 
to use some useful mnemonic such as a flow chart, or stick figures and speech 
bubbles, one set for formal and one for informal interaction. 

C) Student workbook: Students read transcripts of actual samples of spon- 
taneous spoken discourse wfth whole utterances blanked (Hit at certain junctures. 
They then fill in the blanks with what they consider to be the most appropriate 
utterance at that point in the discourse. This can be done in class in pairs, as 
an individual homework assignment, or alternatively students can be encouraged 
to discuss the assignment whh a native speaker outside class. 

PHASE III: Extensive Practice 
Listening Component 

Jphe listening and interpretive activities are essentially the same as in Phase 
II. This is an optional listening phase designed for more advanced students 
and can be done entirely in the language lab with the help of the workbook. 

The texts used in this phase are unscripted conversations. However, because 
of the practical problems involved in obtaining good recordings of the specific 
discourse features and speech act types that I hadfn mind from actual spon- 
taneous speech exchanges, I opted instead to record members of the speech 
community role-playing typical conversational exchanges in a series of pre- 
determined settings. This proved to be an effective way of getting at speakers' 
stereotyped representations of what the ground rules for speaking were in those 
settings. Stereotypes do tend to be somewhat exaggerated, but consistent. This 
approach to collecting samples of natural spoken discourse provides a means of 
eliciting examples of language transactions set within a well-defined context 
and with differing levels of formality. 

Spoken Component 

A) Role-playing practice: This is centered around a series of well-defined 
communicative contexts suggested in the materials for use in class. The role- 
playing activity can be structured in different ways; one fairly productive tech- 
nique is to record the students as they are engaged in the role-playing and ^ 
then to go back and discuss appropriateness of use. 

B) Telephone practice: This exercise is an essential part of each unit. The 
assignment is done in the students' time outside class hours. 

C) Reporting on telephone assignment, e.g., information hunt using the 
yellow pages. 
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D) Discussion and comparison of observations and informal surveys con- 
ducted outside class in the speech community, in Phase IV. 

Written Follow-Up: Open-ended/unstructured. 

A) Student workbook. Questions based on the unscripted conversations of 
» the listening component — students listen to the conversations in the language lab 
and answer the questions. 

PHASE IV r Participant Observation 

This phase is the key to the syllabus. It provides the essential link with the 
speech community outside the classroom. Students are essentially assigned to go 
out and gflther data, to make observations to bring back to class. 

Listening Component 

Making observations of specific aspects of language use in the speech com- 
munity and getting into the habit of taking notes. 

Some of the things that people do with English 
Example: Unit S 

Keep a tecord of all the introductions you observe for a week (including ones; where 
you get introduced to someone). An ideal approach would l>e to follow a host around 
at a party. 

1. Describe the situation. Fonnal/infonnal? Age of people? Place? Describe the 
people. 

2. Did the person doing the intioductions use first names, nick names, last names, 
a tide? 

3. Did anyone shake hands? Who shook hands with who? 

4. What did the person say when he/she did the introduction? 

5. What did the other people say when they were introduced? 

6. What happened next? (Did the first* person explain who the other two people 
were? Did everybody talk alxmt something else?) 

Spoken Component 

Conducting surveys of a) language use and b) language attitudes. Stu- 
dents ask questions to native speakers in the immediate community. 

Written Component 

This involves systematic documentation of observations for comparison 
with other students in class. Questionnaires are provided in the student work- 
book. 

Student aru^tectcher roles 

What are the implications of this kind of syllabus design for language 
methodology? Ob-iously, the focus oh speech events and on working from texts 
of spoken discourse implies that a certain amount of listening practice will be 
integrated with spoken practice: a well-established technique in language teach* 
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ing, but in a different guise, since the main burden of interpretation is on the 
learner. Initially, the role of the teachers would be to ensure that the students 
enter into the spirit of the class right from the start: to make it quite clear what 
kind of approach they will be expected to take toward the language they are 
learning. Students often come to the language clasr with a conservative set of 
expectations about their role and about the kind of learning activities they will 
be engaged in. 

Since the first few classes will set the tone for the course, they should be 
devoted to training students to listen for variation in language use and to view 
language as acts of communication rather than as an abstract grammatical 
system separate from a social context. The following guidelines indicate what 
kind of direction the discussion in the first few classes should take: (the use 
of metalinguistic terminology should be kept to a minimum, although one or 
two key terms/phrases such as appropriateness, communicative junction, Ian- 
guage in use vs. language structure and speaker vs. hearer could be introduced). 

1. What are the functions of language? What kinds of things do we need 
to communicate to other people? 

2. What kinds of actions are only accomplished vviih words? 1 'iat kinds 
of things do we need to do with language? What kinds of thing* iiave you 
already done with language in the situations you've been in today? - 

3. What kinds of expressions are used to do these things with the language 
you are learning? Would this always be expressed in the same way? Why not? 
What kinds of social factors cause people to change the way they make requests, 
the way they greet- each other, the way^they express gratitude and so forth? 

Throughout the remainder of the course the role of the teacher will then 
be: 1) to coordinate the language activities inside and outside the class; and 
2} to act as a native speaker informant. The role of the student will be to 
listen, to practice using the language and, above all, to ask questions about 
the why and the how of doing things with words in English. 

Obviously, no functional syllabus can deal with all aspects of language use, 
any more than a grammatical syllabuf can cover all aspects of the structure of 
a particular language. Course materials can do no more than stimulate and 
guide the learners in the early stages of a long and complex learning process. 
At best, teaching materials can provide students with a set of strategies for 
interpreting the speech events and conversational interactions they participate 
in outside the classroom setting. Students who are living and learning in the 
target speech communities can and should be involved in fairly systematic ob- 
servation of language use in those communities and should be provided with 
guidelines as to what sort of questions they need to ask. 

That is what these materials aim to do. Since so little is known as yet about ' 
spoken discourse and about the ground rules for language use in different speech 
communities, the only option open to the language teacher and syllabus designer 
concerned with language use is to engage the student in something like an in- 
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" formal ethnography of speaking in the classroom and outside; to substitute in- 
terpretation for explam ion, since the latter presupposes that we have some 
established body of knowledge and some set of empirical observations to gu> le 
us; and, finally, to center the interpretive activities around a series of texts of 
spoken discourse which are as authentic as possible. 
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Although there is, general agreement on the importance of developing -peaking 
skills for the elementary school ESL child, the actual techniques for develop- 
ing these oral skills to a level of proficiency are frequently left to intuition. 
Teachers discover that textbook drills do not produce fluent speakers, and that 
the ESL learner who does become communicatively competent does so through 
participation in social situations. 

Current secondary ESL approaches and elementary language arts materials 
for native speakers are synthesized in an ESL Speaking Model which associates 
cognitive processes with different levels of speech behavior. 

Suggested criteria are then, presented for the^seleetion and preparation of 
speaking activities* These include: specification of learning objective and stu- 
dent behavior; topic selection that fits the age, needs and interests of children; 
selection of language structures and vocabulary of greatest utility; timing and 
pacing of speaking activity; audio-visual aids as speech stimuli; incorporation 
of other language skills in the activity; comprehensibility-based evaluation of 
children's oral production. ► 

The .najor part, of this presentation is a demonstration of sample speaking 
activities for each Jevel of the ESL Speaking Model, with emphasis on ap- 
proach and techniques for facilitating those higher levels of communicative 
interaction frequently absent in present commercial elementary ESL materials. 
The structuring of real social encounters within and without the school setting 
is described, as well as the use of community resources to foster the develop- 
ment of communicative competence in children learning English as a second 
language. 

Speaking is a critical social skill (Rubin 1975:120, Logan, Logan, Paterson 
1972:104). In order to participate as a full member of a social group, an 
individuals speech needs to fit that groups criteria of both comprehensibility 
and appropriateness. • ' 

Comprehensibility is, of cqurse, essential if communication is to take place,, 
The listeners attention will not last long if the speakers ideas are not expressed 
in an easily understandable manner. Very young children learning their native 
language have comprehensibility problems; often only the immediate family 
can. understand them (Brown 1973:245). By the age of about three, however, 
they are reasonably successful in making themselves understood outside the 
family circle (Chastain 1976:331). But even school-age children are still making 
mistakes in pronunciation, grammar, and word choice— which their peers 
mercilessly correct. When the school-age child learning English as a second 
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language makes inevitable mistakes in comprehensibility, native English-speak- 
ing peers often express complete intolerance. The reaction of the ESL learner 
may vary from total rejection of the language the peer group represents to a 
strategy of speaking as little and as infrequently as possible (Rubin 1975:112) 
or to a determination to master English and be accepted by the peer group. 
Which attitude is taken is certainly due in some part to the sensitivity of the 
ESL teacher to the comprehensibility needs of the ESL learner. 

In addition to being comprehensible, speech needs to be appropriate to 
the social situation. A speaker who seiects words and expressions that are either 
too formal or not formal enough for a given occasion will receive a perhaps 
subtle but nonetheless negative reaction from the listener. Young children learn- 
ing their native language also have to learn about social appropriateness (Hymes 
1974:75). For instance, in languages like Spanish and French, they have to 
learn by school age that some people must now be addressed as usted or vous 
rather than tu or tu. Children soon learn that some words and expressions which 
can be freely used with peers brings swift retribution if used in the hearing of 
teachers or parents; as they grow more sophisticated socially, they learn which 
speech styles are most effective in different situations., 

In selecting appropriateness of language, the ESL learner encounters even 
greater difficulties than the native speaker. Once a reasonable degree of com- 
prehensibility and acceptance within the peer group has been achieved, the 
ESL child may not realize that additional styles of Englisl^eed to be acquired 
in order to communicate successfully in other kinds of social encounters. The 
ESL teacher needs to provide experiences designed to help the learner choose 
the speech style that is appropriate to a variety of situations (Taylor and 
Wolfson 1978.35). 

Although ESL textbooks and curriculum guides provide many exercises 
designed to develop speaking skills, teachers often find that these do not neces- 
sarily lead to real speaking fluency, particularly if the emphasis is on pattern 
drills (for a neurofunctional explanation of the failure of pattern practice drills, 
see Lamendella 1979:5-18) At least two reasons can be found for this discre- 
pancy between goals and actual outcomes. First, many ESL speaking activities 
are in the form of drills which are too often irrelevant to the child's real ex- 
perience, so the child does not perceive the in as in any way related to real 
communication (Chastain 1976:333;. Second, most ESL materials contain ex- 
tremely few activities designed to bridge the gap between manipulating the 
language with a fair degree of comprehensibility, and communicating through 
the language with social appropriateness (Rivers and Temperley 1978:15). 

The purpose of the speaking activities described at the conclusion of this 
paper is to show how the relevance and interest of speaking drills or language- 
practice activities can be enhanced, and to suggest language-use activities to 
help the child become communicatively competent through use of appropriate 
language in different social situations. 

The skill of speaking has several different components. In order to com- 
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municate an idea, the effective speaker has to select the specific words that 
convey his or her semantic intention, arrange them and modify them accord- 
ing to rules of grammar, apply soHal and situational rules to the grammatical 
sentences, ^nd apply phonological rules that will make the sentence compre- 
hensible to the listener. 

The second language teacher helps students acquire these different com- 
* ponents of the skill of speaking in a variety of ways. As primary source of lin- 
guistic input, the teacher provides a model of speech against which children 
can match their own production. At the same time, the teacher needs 'to be 
aware of other models of English that children hear: in the community, on 
radio and television, from the peer group. Peer models especially provide input 
on the kind of language needed for successful social interaction; it may be in 
the form of a dialect different from* the teachers own, but should not be viewed 
critically on that account (Chamot 1979). The ESL learner will eventually haye 
to learn a number of styles and registers, usually he/she starts with an informal 
one appropriate principally with other children of the sstme age. 

1. An ESL Speaking Model 

Foreign language methodologists generally propose a hierarchy of speaking 
skills ranging from easiest to most difficult (Chastain 1976:351-355). Typically, 
they describe the lowest level of speaking skill as the ability to imitate words 
and sentences from a spoken model At a higher level is the ability to manipulate 
portions of a language segment so ihat sentences are formed. The teacher's 
direct control is lessened as student* increase their ability to create their own 
sentences, so that eventually they can t perate successfully in a free communica- 
tion situation. This last is usually fbe highest level which the foreign language 
learner is expected to rvuch (Chastain 1976-335). 

However, for the child learuhg English as a second rather than as a 
foreign language, the goal of c mprcheusible speech used communicatively 
can be characterized as a midway goal. The native speaker of English comes 
to school communicatively competent in a particular social environment, and 
in the elementary school language arts program this basic competence is ex- 
tended, amplified, and eteljorated. The learner of English as a second language 
must do the same in or Jit to part'eipate fully in both the English speaking 
environment of school and the larger community * 

A synthesis of foreign language speaking skills hierarchies and of the 
types of speaking activities common to language arts programs can provide a 
sequence of speaking skills for the ESL child. A proposed hierarchy of speaking 
skills that provides a sequential plan for teaching speaking in an elementary 
school ESL program is shown in Table 1. 

The first level skill involves imitation of single words, phrases, and sen- 
tences. The learner practices pronunciation, intonation, and grammatical struc- 
ture. 
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TABLE 1 
ESL Speaking Model 


Level 


Cognitive Process 


Student Behavior 


6 


Discourse 


Use appropriate speech style in different 
social situations. 

Use appropriate speech style for different 
purposes. 


5 


Communication 


Communicate in the community. 
Plan a class activity 
Explain a process. 
Conduct an interview. 


A 


VUKJCxl 

Communication 


Participate in structured discussions. 
Give directions. 
Present oral reports. 


O 




Ask and answer naa^tyflLdiJuestions. 

Role play with known s^uences. 
Present recombined information. 


2 


Variation 


Produce recombination sentences. 
Produce recombination sequences. 


1 


Imitation 


Produce contrasts in words. 
Produce contrasts in sentences. 



At the second level, the leamer varies given sentences and longer sequences 
in order to express different meanings. 

The third level requires the leamer to use the speaking skill acquired to 
interact with classmates in several ways: by asking and answering questions 
that have personal meaning, by using known sequences to engage in role play- 
ing, and to present recombined information. 

At the fourth level, the learner engages in activities which have been 
structured by the teacher in order to elicit communication in the second lan- 
guage. Various communication games and activities are a feature of this level. 

By the fifth level, the student is expected to use the communicative skills 
practiced in class in the larger social context of the community (as proposed 
by Corder 1977:10), and to use speaking skills in class without elaborate struc- 
turing by the teacher. 

The sixth level calls for the student to make appropriate shifts in style and 
register depending upon the variables of social situation and purpose of speech. 
At this stage, the teacher s task is to sensitize students to the importance of 
appropriateness In speech style (Rivers and Temperley 1978:11), but real 
acquisition of a variety of speech styles must necessarily occur individually as 
the student encounters different social situations beyond the classroom. 

The student behaviors described on the Speaking Model represent sonw of 
the types of activities that are typical of the cognitive processes specified, but 
are not intended to be all-inclusive. This Speaking Model can be used to select 
and classify different kinds of speaking activities so that a teaching /learning 
sequence is established. The levels are generally hierarchical, that is, each 
builds on the preceding ones. However, there will usually be some elements of 
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the behaviors described at the Disc irse Level present in lower levels because 
examples of socially appropriate speech wilf inevitably form part of the child's 
linguistic input from the very beginning. The hierarchical arrangement does not 
impl)*that all speaking activities in a class will be at a single time; on the 
contrary, only newly introduced language segments will be practiced at the 
first two levels, for as soon as children gain familiarity with a segment, they 
should engage primarily in activities at levels three and above. 

2. Criteria for Speaking Activities 

Criteria for selection of speaking activities need to be established so that 
maximum learning takes place. The first important criterion is the identification 
of the learning objective (Logan, Logan, Paterson 1972:107). By specifying 
the expected outcome of a given speaking activity, the teacher guards against 
those which are- merely busy work or recreational. The Speaking Model can be 
used to clarify learning objectives by identifying what the child will do during 
an activity, and then matching the behavior to the cognitive process it reflects. 
For instance, if children will be learning a dialogue, then the learning objective 
will be found at Level 1 of the Speaking Model, bat if they will be improvising 
a dialogue using known words ai.* 1 structures, then the learning objective will 
be found at Level 3. Objectives at the higher levels of the Speaking Model 
emphasize communication of message rather than correctness of form. Learning 
objectives need to be communicated to the children as well. When* they know 
which speaking skills are being practiced, they will be more likely to monitor 
their own output effectively. 

A second important criterion has to do with the content of the speaking 
activity. Content should be relevant to ;he age of the child; although an older 
beginner may need the same kinds of practice that the younger one does, the 
material that is used for practice should be more sophisticated. 

A third criterion involves the length of the speaking activity. In general, 
speaking activities on the lower levels of the model need to be kept quite 
short and repeated after intervening activities of different types. Practicing at 
the manipulative level of language easily becomes boring, so the teacher needs 
to vary the type of activity frequently. Communicative activities at the higher 
levels of the model can last considerably longer because the focus is on content 
rather than form, and children's interest is thus more easily maintained. 

A fourth criterion has to do v 1 model of linguistic input on which 
the child will be basing his or he .utput. Speech input should be clear, 

easily understood, and free from ^. .cttons such as mannerisms, overuse of 
idiomatic expressions, incorrectness, and unnatural pace. Teachers need to 
monitor their own speech to be sure that it is an acceptable model for children 
learning English (Rubin 1975:109). 

The fifth and most important criterion has to do with the meaningfulness 
of the .speaking actis ity. Although meaningfulness is built into the higher level 
communicative activities, this is not always the case wjfkthe language-practice 
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activities at the lower levels of the Speaking Model. Too often manipulative 
exercises become mechanical and detached from even the most tenuous contact 
with realistic communication. In order to prevent this from happening, a con- 
text for every shaking activity needs to be provided. This means that every 
sentence for repetition and every variation drill needs to be set either in a real 
situation or an imaginary one where the utterances being practiced could occur 
naturally. A real situation can often be established through a game which 
establishes a purpose for language-practice drills An imaginary situation can 
be set up by the teacher with context provided through description, pictures, 
and real objections. When manipulative level activities are thus contextual ized, 
children can more easily perceive the" language being practiced as a real part 
of the communicative process; 

The sfitth criterion is an outcome of the first, for the learning objective 
set for a speaking activity will determine how it is to be evaluated. If the 
learning objective is at Level 1, then the teacher will evaluate pronunciation 
and intonation. At the next two levels the teacher may be concentrating on 
grammatically and vocabulary, while at the communicative levels, the focus 
will be on content and meaning rather than on form. The same type of speaking 
activity may be evaluated for different aspects on different occasions. While 
comprehensibility may be the main objective for a considerable time in the 
child's acquisition of English, the further goal of social appropriateness also 
needs to be evaluated. 

Criteria such as these help teachers plan speaking activities that can be- 
come a purposeful and central part of the ESL curriculum. 

3. Sample Speaking Activities 

Representative speaking activities for each level of the ESL Speaking 
Model are described in this section. Except for two general suggestions for 
activities at Level 1, all of the specific activities described call for individual, 
paired or small group speaking experiences. Theae call for classroom organiza- 
tion in which the teacher plans for several different things to be going on 
simultaneously, rather than having total group practice of identical spoken 
patterns. Although this type of teaching requires detailed planning, it allows 
for more individual responses and more frequent opportunities for all the chil- 
dren to engage in group conversational practice (Chastain 1978:357). The 
teachers role becomes that of model, initiator and facilitator, and student- 
student interaction is actively cultivated. When the teacher organizes language 
learning experiences systematically, an environment is established in which 
children can engage in the creative construction process of second language 
acquisition (Garcia and Gonzalez-Mena 1976.4). 

LEVEL 1— IMITATION 

When imitating the teachers model, the child needs not only to under- 
stand the meaning of the language segment but also to feel that there is a 
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good reason for imitating it. This can be difficult to provide, for in real life 
we do very little oral repetition of what someone else says. 

Suggestions for making group repetition more enjoyable can be found in 
'innovative approaches such as chanting (see Graham 1979) and in choral speak- 
ing activities using nursery rhymes and children's poetry' (see Rubin 1975:123- 
128). Material for choral speaking for the ESL child, however, needs to be 
selected carefully so that it is not only meaningful but also has standard spoken 
syntax and useful vocabulary (i.e., poetic license, archaisms, and rare structures 
are not useful models for beginning ESL speaking practice). 

For individual repetition, small group work is essential. The teacher work- 
ing with six or seven children .can provide a meaningful context, encourage 
each child to participate, and give individual attention to each child's spoken 
output. The most fluent children will be asked to imitate the teacher's model 
first, thus serving as additional models for the less fluent children. 

One suggestion for younger children is the use of a toy animal or puppet. 
The teacher starts the imitation activity by showing the toy, and the speaking 
sequence goes something like this: 

Teacher: This is my pet dog, Snoopy. He doesn't obey me! When I say, 
"Snoopy, sit down," he doesn't sit down. Qo you think Snoopy 
will obey you? Let's try! I will tell you what to say. Ready? 

Child: (Nods.) 

Teacher: Snoopy, stand up. (Keeps toy dog in sitting position.) 
Child: Snoopy, stand up. (Teacher makes dog stand up.) ' 
Teacher: Snoopy, lie down and turn over. (Keeps dog in stancKng posi- 
tion. ) 

Child: Snoopy, lie down and turn over. (Teacher makes dog do so.) 
After the initial practice, a child can manipulate the toy, having it move only 
when another child imitates the teachers model. 

A suggestion for older children requires the preparation of a short tape 
with either spelling words or a narrative on it, and with repetition pauses after 
each segment. A group of children sit at a table with pencil and paper or work- 
sheet. One child listens to the tape through a headphone, and repeats each 
word or phrase on the tape during the pause provided. The other children 
must write down what is being dictated to them. In the case of a longer 
narrative, this may be a cloze dictation in which children have a worksheet 
With part of the text on it and must fill in the missing words as they hear 
them. Different children can take turns dictating the taped material. This 
activity motivates children to repeat as accurately as possible in order to help 
their peers write the exercise correctly. 

LEVEL 2— VARIATION 
At this level children are encouraged to vary sentences by adding or chang- 
ing parts of tlit^li. Here again, coutcxtualizatiou and individual responses are 
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preferred so that children can make meaningful rather than mechanical varia- 
tions. 

Younger children can derive continuing* enjoyment from a mystery^ box 
with ever-changing contents (Rubin 1973:141). The teacher can review voca- 
bulary as different objects ( realia, . toys, miniatures, doll house furniture, etc.) 
are dropped into the mystery box. Then children take turns dipping into the 
box fcnd having others guess what each is holding inside the box. The dialogue 
can go something like this: 

Child 1: (Dips into the mystery box and picks up an object without 
showing it.) Guess what 1 have! 

Child 2: Do you have the car? 

Child 1: No, I don't. 

Child 2: Do you have the chair? 

Child 1: Yes, I do. 

For older children, a strip cartoon can be modified to serve as a context 
for variation practice. Choose a cartoon sequence that has a question-and-answer 
dialogue, and cut away either the qu^stiQns or the answers. It may be necessary 
to re-write the remaining text so that the structures and vocabulary are known 
by the children. A transparency of the modified ' strip cartoon isf then shown, 
and children are asked to provide the missing bits of dialogue from the clues 
givpn by the drawings and the text. 

LEVEL 3 — INTERACTION 

At this level, the emphasis is on interaction between children, and groups 
or pairs can work together at the same time (see Mellgren and Walker 1979: 
P-8). 

Children can learn to ask and answer meaningful questions that are per* 
sonalized. The teacher can suggest, orally or in writing, three or four questions, 
such as: 

When is your birthday? 

What do you want for your birthday? 

What arc you going to do on your birthday? 

Each child then interviews another child in the classroom, and when all the 
interviews have been completed, children take turns presenting the information 
discovered to a group. 

Simple riddles can also be used for interaction practice. Humor is a difficult 
aspect of a second language to acquire, but simple plays on words and jokes 
(the cornier the better) can be a good beginning point ( Trachtenberg 1979:91). 
Many riddles follow a very simple format, and once this is learned, .children 
with even limited knowledge of English can invent their own and have their 
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classmates guess the answers. The teacher may need to rewrite riddles to make 
the structure and vocabulary within the reach of the ESL child, but many of 
the classic descriptive riddles ("What is [adj.], and [verb]?') can be used 
- without simplification. 

LEVEL 4 — GUIDED COMMUNICATION 

At this level, children engage in discussions and problem solving activities 
that are structured by the teacher. In communication activities, the content of 
the message being transmitted is more important than its form, hence teacher 
corrections should not be given during the activity (Chastain 1976:337). Many 
of the discussion activities recommended for regular language arts classes can 
be used at this level. 

Using a map to give directions from one point to another is an activity 
that can fit into the needs of both younger and older children. The map can 
be of the classroom, of the school, of the neighborhood around the school, or 
of the whole town. It can be reproduced on the chalkboard, a transparency, 
a poster, or a playing board. It can even be drawn in chalk on the floor of the 
classroom so that children can walk from one point to another as they give 
directions. A neighborhood map can be used to give directions to and from the 
home of each child, and a city map can give children practice in learning how 
to get to major points of interest. 

An activity for second grade and above has the teacher ask children to 
think of a place each would like to visjt, but to keep it secret. Each child then 
writes down five things needed for the trip. The class is divided into small 
groups and children take turns telling what thty will need. Others in the group 
ask why each thing is necdutip MWB £n appropriate reason has to be given. 
After all five things have been questioned and explained, group members begin 
guessing the destination (Yorkey, Barrutta* Chamot et al. 197iH29). 

Croup decision making activities encourage participation in discussion 
(Mountain 1974:24) and in the ESL class can provide speaking practice in a 
natural context. In this type of activity, a problem is first set up by the teacher. 
Individual children record solutions to the problem, then the class is divided 
into small groups which discuss the individual solutions and eventually reach 
a group consensus. Evaluation of the solutions is provided through an authorita- 
tive source, and children usually find that the group decisions are closer to the 
correct answer than are most individual solutions. A* an example, the different 
steps of a recipe for brownies can be presented in incorrect order, and the 
task is for children to arrange them in correct order. After individual children 
have put the steps in order, they join ^flroup to discuss a final order. The 
teacher can then present the actual rec (Be for comparison, and as a follow-up 
the class can engage in an actual cooking experience. 

A guided communication activity for middle grade students with some 
reading ability in English is called Information Exchange. In this activity, the 
class is divided in half. One half works on an assignment with the teacher or 
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individually. The other half is given a worksheet to read carefully; the content 
of the reading passage can deal with any subject area relevant to the class 
(science and social studies passages are good choices). After the material has 
been read, the teacher collects the worksheets and each student who has read 
the information must now report it to one who has not. After the paired discus- 
sions, students who did not read the information are given a worksheet with 
questions to answer about the material, while the teacher works with the other 
half of the class. This activity encourages students to report accurately and to 
question anything not understood clearly. 

LEVEL 5— COMMUNICATION EXPERIENCE 

Activities at this level provide experiences in which children use English 
for communication in a real, rather than a contrived, situation. The ideal ex- 
perience is for children to engage in communicative encounters in English in 
the community. The teacher can encourage this type of encounter by keeping 
a Communication Journal'for younger children, and, by having older children 
keep their own journal. In it will be recorded the date and description of the 
communicative encounter, with comments on its degree of success. These jour- 
nals should be shared with the teacher and with other children on a regular 
basis. A Communication Journal can help motivate a child to actively seek to 
use English outside of school, and can provide the teacher with insights irfto 
the child's ability to cope in a free communication situation. 

Classroom centered communication experiences can center around group 
planning for a class activity such as a party or picnic. Children can volunteer 
to work in groups to plan each aspect of the class activity before the final plan 
is synthesized and implemented. 

Another communication experience which can take place in or out of the 
classroom involves conducting an interview with a person whose job or hobby 
is of particular interest to the group. Suggestions for interviewing preparation, 
techniques, and follow-up can be found in most language arts teaching guides 
(see Logan, Logan, and Paterson 1972:118). The basic procedure calls for 
selection of the person to be interviewed, writing the interview questions, and 
practicing interview procedure. 

Many communication experiences draw on the individual interests and 
talents of children. An example is one in which each child explains a process 
to a group of classmates. Discussion will yield information about special abilities, 
which can range from building models, cooking, playing an instrument, ironing 
a shirt, making a toy, etc. Each child should bring any materials necessary to 
explain and demonstrate how to do something that he or she does well. In addi- 
tion to providing speaking practice in English, this activity posters a positive 
self-concept and pride in achievements that are not necessarily school-related. 

LEVEL 6-DISCOURSE 
As mentioned earlier, there are elements of discourse at every level of the 
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Speaking Model, for as soon as a speaking activity is contextual ized, the lan- 
guage used in it will be appropriate to the social situation in which it occurs. 

Role-playing experiences can help children become awarfc of different 
speech styles and registers to use according to t\\e situation portrayed. In order 
to set up realistic situations for role-playing, the teacher needs to be informed 
about the kinds of social encounters the ESL child will typically have to deal 
with in a given community. Useful suggestions for role-playing situations can 
be found in language arts methods texts (see Rubin 1975:132-134), but to 
these the teacher should add situations that are part of the daily life of the 
ESL child. Familiar situations might include: being lost and having to tell an 
adult; damaging or breaking something and having to explain and apologize; 
being sick and having to go to the doctor, or accompanying another family 
member to the doctor and having to explain the nature of the illness; getting 
in trouble at school and being sent to the principal; asking a policeman for 
help; using language to avoid difficulties with policemen and other authorities; 
watching television at a friend's house; getting into a fight with another child 
who speaks only English, babysitting, buying something, getting a prescription 
filled; telephoning for information. After deciding on the situation, children 
choose the characters that will take part in it. The teacher should discuss with 
children the feelings of the characters involved iji the role-playing situation, and 
their probable style 'of speech, whether formal or informal, friendly or antagon- 
istic, simple or elaborate. Props add greater reality to role-playing and also 
help children enter into the characters of the persons they are portraying. These 
can be as simple as a hat, badge, handbag, briefcase, stethoscope, whistle, note- 
pad, or toy telephone. After the situation is role-played, children can discuss 
the eHectiveness of the kind of speech style each character used and make sug- 
gestions for changes. 

As children become aware of the uses of different styles of speech, they 
will begin to use language not only to communicate ideas and feelings but 
also as the means to social survival. 
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Increasing numbers of international students are coming to the United States 
to study Engineering; in fact, Engineering is the most popular field of study 
among international students at American colleges and Universities (HE Re- 
ports, 1978). This increase creates a need for effective materials and methods 
for teaching technical English to international students. 

This paper describes an application of the Personalized System of Instruc- 
tion (PSI) to the teaching of technical writing to intemational students. PSI, 
as used in this course, is distinguished by: 1 ) mastery learning and the |^e of 
small units which specify what the student is to learn, 2) unit study guides 
which include writing assignments, 3) frequent evaluation, 4) self-pacing which 
allows individualized time for learning, and 5) the use of proctors to maximize 
individual attention and assistance avaik ble to each student. 

Engineering is the most popular field of study among international stu- 
dents at American colleges and universities (HE Reports 1978). Between 27* 
and 34% of alt engineering Ph.D.'s from American academic institutions are 
earned by international students (The Chronicle of Higher Education, 1978). 
West Virginia University is one of the many universities that reflect this in- 
crease in international students, with approximately 350 international students 
of engineering, about half graduate students. 

In the summer of 1977, the University investigated the feasibility of expand- 
ing its EFL program to accommodate the growing number of international stu- 
dents. Faculty and intemational students were intervifwed regarding ESL needs. 
In-depth interviews *vere conducted with the engineering faculty and students ^ 
due to the large number of international student* studying the subject. The fac- 
ulty consistently commented on the international students* low level of English 
skills, particularly their low level of writing skills in lab reports, research papers, 
letters, and theses. The international students also admitted having writing 
difficulties, especially those of conciseness on essay exams and lab reports. 

In the fall of 1977. representatives from each engineering department met 
with the ESL faculty to confer in the development of a technical writing course. 
The engineering faculty supplied copies of student reports, course requirement^ 
and textboqks; all were used in the course. They also reviewed technical writing 
textbooks, commented on course construction, and eventually advised their stu- 
dents to take the course. In selecting a course design, two major concerns were: 
1) meeting the individual needs of students with differing- English levels and 
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problems; and 2) directly relating the technic* . writing course* to students' 
engineering courses so that it would provide a quick and noticeable improve- 
ment in the quality of their writing. It was primarily because of these two 
concerns that the Personalized System ot Instruction (PSI) was selected. 1 

At that time, many of the additional benefits of using PSI m an ESL class 
had not been fully realized. 

PSI, as used in technical writing, is distinguished by: 1) mastery learning 
and the use of small units which specify what the student is to leam; 2) unit 
study guides- which include writing assignments; 3) frequent evaluation, 4) self 
pacing which allows individualized time for learning; and 5) the use of proctors 
to give individual attention and assistance to students' particular problems, the 
first concern of differing English levels. The self-pacing component of PSI also 
allows for differing English levels, the advanced student can finish early and 
the less-advanced .student can take more time and receive additional help from * 
the proctor. In writing the study guides, material from the WVU College of 
Engineering was used so that the questions and wiitiag assignments related to 
the students engineering courses, thus meeting th^second concern. Finally, the 
unit mastery component contributes to b(?th concerns by insuring that each 
student attains a high level of«iompetence with the subject matter in each unit 
and eliminating the problem of cumulative failure, encountered when students' 
failure at early -course material inhibits their performance on later material. 
Given the concern for rapid, noticeable Improvement, the requirement of unit 
mastery provides an assurance of improvement in quality through the high 
mastery criterion and frequent performance evaluations. 

PSI has been the topic of hundreds of published research rticles and the 
(A-erwhelming majority of these studies provide convincing evidence that PSI 
has been significantly more effective than traditional approaches. Several re- 
views have indicated that average student performance ranges 9-15% better than 
student performance in traditionally taught courses (Hursh 1976, Kulik, Kulik, 
and' Smith 1978). Research has demonstrated that students in a PSI course re- 
tain more material when tested months later. In addition, it has consistently 
been shown that although students report that a PSI course requires more work, 
they also report enjoying the course more than others (Kulik, Kulik, and Smith 
1976). PSI has been used successfully in 4 wide variety of disciplines, from the 
"hard science" duciplmes of engineering and physics to the abstract fields of 
poetry, philosophy, and English (Hursh 1976). 

1. Course Construction 

The technical writing course is divided into twelve units— four grammar 
units and eight units based on the text, Reporting Technical Information by 
Kenneth Ifwip and Thomas Pearsall (1977). Kach unit is composed of a reading 
assignment. 20-30 study questions on the leading, appropriate writing assign- 

1 For additional information, contact the Confer for Personalized Instruction, Georgetown 
University, Washington, IXC, 20057. 
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ments, and three equivalent forms oi a unit quiz. After .students complete 
both the written assignments and the study questions, they take a quiz. A 
score of 90-100$ on the quiz, indicates mastery of the material reflected on a 
cumulative progress chart which is attached to the students' quiz folders. If 
they do not obtain 90?, they resume studying and when ready, take a different 
f<>rm of the quiz without penalty (Figure 1). 

2. .Study Guides 

In using PSI to teach ESL, quiz mastery marks a student's passage to the 
next unit, but it is not the sole focus of the course. An essential component is 
the student's written work in the study guide. Each study guide contains 20-30 
questions and problems from the unit reading assignment that necessitate stu- 
dent implementation of material, not memorization. An example of such im- 
plementation is: 

Your company has recently developed a new deep water oil rig which utilizes a 
completely new system of drilling. Your job is to write a report on the subject which 
will be read by the president and board of directors. How much can you assume 
these people know abo* * deep water drilling Why? What specific tilings alxnit the 
new \*l rig would you w «nt them to know. 

Each study guide also includes several writing assignments that students com- 
plete and discuss with a proctor before taking the unit qui;/. The following 
writing assignment from the unit on prose elements of a technical report serves 
as an illustration: 

1. On a separate sheet of paper, write a letter of transmittal to one of your pro- 
fessors to whom you are submitting a paper. You may use the form in the text- 
book or check the fo^ii that your department recommends Show this letter to 
a procvor before taking a quiz. The letter should be neatly written. 

2. Write a descriptive abstract for a paper that you have written. You may refer 
to your departmental guidelines. Show the abstract to a proctor before taking a 

- Quiz c 
X Write a statement of purpose for a report tnat you are writing or tor a report 

that you plan to write. Show the ^atement of purpose to a proctor. 

* The study guides move from tight control to little control of student responses, 
from rule identification to rule implementation, from dchh'r.g an abstract to 
writing an abstract. 

3. Proctors d 

The proctors are graduate and undergraduate nauve FngMsh speaking stu 
dents who are studying to become ESL instructors. Th< v are reimbursed with 
credits in fields such as ESL Methodology, English, or Education. As shown 
in Figure 1. the proctors play a vital role in a PSI wiiting course, with three 
main responsibilities: 1) eflectively discussing student work, 2) correcting 
quizzes, and 3) recording progress on the students' progress chart. 

Discussing their written work with the student' is the most important and 
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inost difficult aspect of the proctor's task. Rather than simply providing the 
correct answers, their role is to ask questions that help the students arrive at 
the corrections themselves. Examples of such questions or responses are: 

Can you think of another way to say this? Read this carefully t nd try to figure out 
what's wrong. What tense are yon hsmk? Is thi. word countable or uncountable? 
Check the spelling of this 
Rather than simply telling the student the correct answer, the proctor prompts 
the student to supply the answer, thus considerably improving student per- 
formance and retention of material (Johnspn 1978). 

Proctors also make note of ambiguous study guide and quiz questions, * 

* rewriting them for the next class, the course is continually being revised. They 
also refer students to the text or to supplemental material when necessary. An 
additional function of the proctor is to help students with technical reports 
they are concurrently doing in their engineering classes. If proctoring is to be 
effective, the proctor student ratio needs to be small, about 5:1. 

£. Conclusion 

The success of the technical writing course can be measured by student and 
faculty evaluations, pre test and post-test scores, and improved writing in cngi- 
neeriug courses. The course has been taught three times so far, with approxi- 
mately 20 students in each class, student evaluations have been consistently 
high. The students are very enthusiastic about the method, the self-pacing, 
the large amount of information they learn, and especially the learning of 
various forms of writing— types of letters and reports. These high evaluations 
are consistent with research on PSI: Kulik, Kulik, and Smith (1976) report that 
students rate PSI courses significant higher than traditionally taught courses. 

The pre-test and post-test includes one question fiom each of the twelve 
units and asks the students to write a letter applying for a giaduate assistant- 
ship. A simple t-test tor repeated measuies indicates a significant increase in 
scores between the pre-teSt and the post-test (p < C1). In comparing the first 
and second letters, the improvement in overall writing skills is immediately 
noticeable their aie fewer errois in gi annual, lewei mistakes in the use of 
tense, agreement, articles, prepositions, even spelling Also noticeable is the use 
of coir "iitions, such as the appropriate fonn of the lettei, abbreviations, capital- 
ization, dates, spacing, and style. 

At the completion of the first couise engineering faculty were asked to 
comment on any Impwn ement in student w ittng, the replies were over- 
whelmingly positive. The Depaitment ol Chemical Engineering, tor instance, 
began requiring all their international giaduate students to take the course and 
other engineering depaitment s aie considering the same. 

♦ ; An additional feature ol this endeavoi wai rants further comment The 
success of the PSI technical wilting couise at West Virginia Uimersity is in 
large pait a icsult of the ( ollaboiation of thiee t\picall\ unrelated fields: 
English as a Second Language, Engmeeiing, and Insti fictional Psxehology, By 
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incorporating information from each of these three disciplines, the content, 
design, and relevance of the course was greatly improved. This inter-disciplinary 
approach seems particularly important to university ESL instructors who are 
teaching English to students whose prir ry field of study is something other 
than English. 
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Rhetorical Competence and EST Discourse 

G#org« R. Htpworth 

Bilingual Education for Choctaw* of Mississippi 

ESL teachers are aware that effective communication often depends on recog- 
nizing when the actual purpose of an utterance is different from that suggested 
by its surface form. Implicit functions are, therefore, a frequent source of dif- 
ficulty for ESL speakers. Unfortunately, however, the importance of this fact 
is too often overlooked for written discourse. Texts used in teaching English of 
Science and Technology (EST), for example, tend to focus on the specialized 
vocabulary rather than the rhetorical structures and functions of EST discourse. 

This paper presents the results cf an analysis of EST discourse using a 
rigorous method of description- 1 - the Discourse Bloc Model, a model which 
identifies units of discourse by their functions rather than by their forms. When 
the Introductions to Experiment Reports from several fields of. science were 
looked at, the resulting description identified the rhetorical functions employed 
consistently. An idealized version of the Introduction to an Experiment Report 
can thereby be proposed. The predicted correspondence between, implicit', 
rhetorical functions and explicit discourse functions also was found to exist. Thfc 
term rhetorical competence is used to refer to the ability to recognize this use 
of discourse functions in EST. ^ 

Communicative competence has become one of the more important ideas 
in ESL pedagogy in recent years (Paulston 1974, Rivers 1976). In one excel- 
lent definition of communicative competence, Savignon (1972) says< 

Communicative competence may be defined as the ability to function in a truly com- 
municative setting, that is, in a dynamic exchange in which linguistic >mpetence 
must adapt itself to the total information output, both linRuistic and puralinguistic, 
of one or more interlocutors. (Emphasis my own.) 

This definition, which focuses on the essential role of paralinguistic factors in 
the proper interpretation of, some utterances, also serves to illustrate the cur- 
rent preoccupation with spoken, as opposed to written, discourse in most dis- 
cussions of communicative competence. 

In this paper, ' I discuss written discourse, more particularly discourse 
written in the English of Science and Technology (EST). I first attempt to 
show that, in much the same way that oral communicative competence often 
involves recognizing the difference between the explicit functions of certain 
utterances* and the implicit functions understood for those utterances, the 
* rhetorically competent reader of EST discourse recognizes when certain ex- 
plicitly-signalled discourse functions, sucli as coordination, have other, implicit 

I wish to thank John Lackstrom anc* Will Pitkin for vaii^hk comments on an earlier ver- 
sion of this paper. Any errors or deficiencies are, of course, my own. 
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functions necessary for the proper interpretation of that discourse. I will intro- 
duce a method of discourse analysis known' as Discourse Blocs, originally de- 
veloped by Will Pitkin (1969, 1973), in order to provide a framework for this . 
discvision. 

1. Implicit and Explicit Functions 

One (actor in the proper interpretation of utterances is the recognition of 
implicit fruitions of certain sentence types. Sentences of the type "Do you have 
X?" have in normal conversation the force of requests. Consider (1) and (2): 

( 1 ) Do yo\i have the correct time? 

(2) Hey, got the time? 

For our purposes, it is not necessary to indicate the exact ways in which (1) 
and (2) are understood as requests and not as yes/no questions, that is, to 
have implicit communicative functions. 1 However, it is necessary to realize that 
the communicatively-competent speaker of English would reply to (1) and (2) 
with (3) and not with (4): 

(3) Yes, it s about 9:30. ; • - 

(4) Yes, I do. 

2. The Functions of Written Discourse 

Our 1 ypothesis is that in some written discourse, such as that of EST, there 
is an analogous situation. Explicit discourse functions— such as causality and 
contrast — may also serve other, implicitly determined, rhetorical functions spe- 
cific to the type of discourse in which they appear. For example, Lackstrom, 
Selinker and Trimble (1970) listed the lhetorical functions for an experimental 
report as the Statement of the Problem, the Hypothesis, the Deduction of the 
Consequences, Description of Procedures, and so m. However, these rhetorical 
functions are very seldom explicitly identified in the text of any report, so that 
identifying them as such appears to be £ post hoc operation. We wondered if 
there might not be some way to identify these rhetorical functions by the ex- 
plicit discourse functions employed, in addition to the content which might 
indicate those functions. We wondered if the use of causality, contrast, time 
or space order, and so on, could be correlated with specific rhetorical functions 
such as the Statemetit of the Problem, or the Hypothesis (see Figure 1). 

Iu (*der to test our hypothesis, we carefully analyzed the introductions to - 
several experimental reports. Because we \yere looking for functional, not formal, 
properties of this discourse, our analysis had to proceed along functional lines. 
Discourse Bloc analysis, described below, provides a convenient framework in 
which to describe both the explicit discourse functions of EST and their implicit 



rhetorical function counterparts. 



) See Seark (1975) for an excellent presentation of indirect speech acts. 
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Figure 1. 



Oral Communication: 
, Communicative Function 



=■ Sentence Type 



REQUEST FOR X 
'Written Communication: 
Rhetorical Function 



"Do you have X?" 



\ 



= Discourse Type 



PROBLEM 
HYPOTHESIS 



PP 

p? (Contrast, causality, time 
?? and space order, etc.) 



DEDUCTION OF CONSEQUENCES 
Jfc Discount Bloct 

Traditional treatments of discourse have been deficient because they are 
primarily descriptions of the formal properties of sentences, and have often, 
failed to account for the functions of those sentences. Pitkin (1969, J973) 
recognized the limitations of conventional analysis And proposed that the de- 
fining characteristic of discourse units be a functional one rather than a formal 
one. 

3.1. % Tbe Functional Units of Discourse (Defined). Formal units erf discourse — • 
the clause, the sentence, the paragraph, and so on — tend to be inadequate 'de- 
scriptions of the basic functions of discourse, ^ven when they are reinterpreted . 
a!ong more functional lines, as in Longacre (1970, 1976). The following dis- 
c mrst shows this. / 

(5) a, John is a good skier. He doesn't have time to compete profession- * 
ally, however. 

b. John is a good skier; however, he doesn't have time to compete 
professionally. 

c. John is a good skier, but he doesn't have time to compete pro- 
fessionally. £ 

d. Although he is a good skier, John doesn't have time to compete 
professionally. 

e. A good skier, John still doesn't have tim4 to compete professionally. 

Although (5a) and (5b) might be considered stylistic variants, (5c) quite 
clearly could not be equated with them on the basis of its formal properties. 
However, it is equally clear that the relationship between the two parts of 
each version of (5) is constant; after a concession is allowed (John's ability to 
ski well), a contrasting assertion is made (Johns' lack of time). Note that by 
defining one unit as a concession and the other as a (contrasting) assertion, it/ 
is possible to portray the functional relationship between the two units of (5a) 
in the same way as for (5e). further, these two units can be said to appear 
in a contrastive relationship solely by virtue of the fact that they are used to- 
gether. Concession assertion requires two units of discourse, and it requires 
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them to appear as parts of a single relationship which can be defined by the 
functions of .those two units. This allows us to ignore, temporarily, the gram- 
matical form of such sentences, while we concentrate on their functions. 

There are a number of such functional relationships in discourse. The fol- 
lowing is a brief list of those that appear most frequently. 1 ' 



COORDINATE COORDINATE: 
INCLUDER INCLUDED: 

CONTRAST CONTRAST: 
CAUSE .EFFECT 

r 

ASSERTION RK ASSERTION: 
FOCUS ASSERTION: 



A concatenation of two or more inde- 
pendent units, either additive X/and Y, 
or alternative X/or Y. 3 

These are relationships in which a more 
inclusive unit is paired with one which 
is more specific. There are four ways in 
which the shift can be made — X/ namely 
YrX/for example Y, X /including Y, and 
X/ especially Y. Further, there are four 
modes of analysis — operation stages or 
phases, genus/ species or species /sped- , 
men, complex/ components, and whole/ 
pari. 

There are two main types of contrast — 
concession assertion (true X, yet Y), and 
negative /positive (not X/ rather Y). { 

Although causality, is too complicated 
to treat adequately here, we can state 
that, as is the case with other functional 
relationships, cause and effect units ap- 
pear in discourse. Four broad categories 
are the conditional (if X/Y), br uncon- 
ditional (because X. Y), and the mental, 
or logical, cause and the physical cause. 

A restatement of an assertion appears as 
X, that is Y. 

A presentential modifier may be used to 
focus on or to identify the time, place, 
or source of an assertion. 



J See also Pitkin |977a. 1077b, or Hepvsorth 1978. 

1 The X/Y formulas are a shorthand for the functional relationships. X is the first unit 
in a discount' Y the second The t>pe 0! relationship is indicated h> the use of a signal ftUe h 
as "and" or "or" 

•Since this paper u.is originally written, I have received by private communication 
Borkin's (1978) paper on dissonant contrast. It seems to support the general claims heing 
made here. However, it seems best to delay full incorporation for the present. 

o 
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ASSEHTION/INTENSiFICATION 

ASSERTION/REGRESSION: An assertion can be made stronger, X/ 

indeed Y, or weaker, XI at least Y. 

COMPARISON/COMPARISON: Comparisorls, or similes and metaphors, 

are represented as just as X/so Y. 

Although this list is not complete, it captures the functional basis of the 
binary discourse units and indicates the broad categories of relationships which t t 
appear in a)\ discourse. * 

3.2. Hierarchical Ordering. In Discourse Bloc analysis, all. discourse is con- 
sidered to be hierarchically ordered into increasingly more complex blocs. This 
ordering appears in (6). 

(8) She fixed dinner— making salad, broiling a steak, and cutting and 
toasting garlic bread — and called her husband. 

In this operation (preparing for dinner), the two main stages are fixing 
dinner and calling her husband. Fixing dinner requires three sub-stages to com- 
plete: fixing salad, broiling steak, and fixing garlic bread. And fixing garlic bread 
in turn requires two sub-sub-stages to complete: cubing (garlic bread) and 
toasting garlic bread. The functional hierarchical structure of (6) is reflected 
in the restraints on formal arrangement of the units. In longer blocs of discourse 
the same hierarchical ordering of units occurs. As we noted above, these units 
may be of any grammatical form, so that a single noun phrase, for example, i# 
may be used in contrast to several sentences. This property of discourse makes 
it necessary to redefine its basic units in functional terms A discourse bloc is a 
functional unit consisting of a main binary relationship between two sub-units; 
each of those sub-units may be composed of ' irther binary sub-parts. Hence, 
, the absolute size of any discourse bloc is limited to the length of discourse one 
chooses to consider. 

3.3. Diagramming. A method of diagramming which reflects the functional and 
hierarchical structuring of discourse is an essential part of the analysts. TTtese 
diagrams depict the horizontal and vertical relationships which hold for a given 
discourse bloc. Figure 2 is the Discourse Bloc Diagram of (6). 

The horizontal lines reflect the relationship at each level; the vertical lines 
topped tt>y a bloc signal such as and or or indicate which units are involved in 
the functional relationship at that level and what the relationship is. For ex- 
ample, the two main stages of (7) are coordinate (X/and Y). The first main 
stage is an operation and three of its sub-stages, so the next level down in the 
diagram states that there is an operation and stages (operation/including stages). 
And the third sub-stage thus identified is furthei broken down into its own 
two sub- sub-stages (X/and Y). 

In this rather brief treatment of the Discourse Bloc method of analysis, we 
have tried to indicate its three main features: 1) the recognition of explicit 
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discourse functions in all discourse; 2) the hierarchical ordering of all discourse; 
and 3) Bloc Diagram functions. 

4 The Analysis of EST 

When the introductory paragraphs ) of experimental reports are examined 
to determine their rhetorical structures, certain similarities appear. As indicated 
above, Lackstrom, Selinker and Trimble (1970) state that the Introduction to 
an experimental report will contain: 1) the Statement of the Problem (PROB); 
2) the Hypothesized Solution (HYP); and 3) the Deduction of the Conse- 
quences (D of C). It is no surprise that these, too, are functionally defined units. 

These rhetorical functions do not appear in the inventory of discourse func- 
tions found in general use precisely because the EST functions are defined in 
terms of the specific area in which they are used— namely, the introduction to 
an experimental report. While the rhetorically-competent discourse analyst and 
the rhetorically-corrfpetent scientist may be able to recognize these features, 
and perhaps even to use them in writing effectively, it is not at all clear that 
the same will be true for many ESL/EST students. It is, in fact, quite likely 
that the ESL/EST student will need considerable practice in learning to use 
these rhetorical features in writing. It would, therefore, be quite valuable to 
have a more precise description of the ways in which Problems, etc., are written. 
Thus, based on the hypothesis suggested above— that there will be a correlation 
between the explicit discourse functions such as causality and the implicit 
rhetorical functions or EST such as causality and the implicit rhetorical func- 
tions of EST such as PROBLEM— analysis* of the introductions from several 
experimental reports was carried out following the discourse bloc model. As 
predicted, there are some highly significant similarities, as well as some im- 
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pbrtant differences. Figures 3 and 4 depict the two basic versions of what we 
will come to call the Idealized Introduction to an Experimental Report. 

Figure 3. 



cause 


effect 


true 
X 


yet 
Y 











Figure 4. 



cause 


effect 


true yet 


and 




X 
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if 
X 
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In these Bloc Diagrams, only those discourse relationships which play a 
significant and insistent role in the development of the introduction are shown. 
Abo, since Figures 3 and 4 are syntheses of many such diagrams, it is highly / 
unlikely that one will encounter any single introduction exactly like either, / 
although, of course, it will be very similar to one or the other. Note that in most 
particulars, Figures 3 and 4 are identical, except that Figure 4 has a more 
elaborated second stage. 

After we found the Bloc Diagrams of Figures 3 and 4, we related the 
explicit discourse functions that appear in Figures 3 and 4 to the rhetorical 
functions PROB, HYP, and D of C. Problem, being a single rhetorical function, 
had to be redefined for this purpose. Discourse relationships are binary, which 
means that the Problem statement must be further analyzed either into two 
constituent pr.rts or paired with a second related unit (either horizontally or 
vertically expanded in the Bloc Diagram). Traditional Discourse Bloc analysis 
dfc^not provide for this description of the problem, although there is some 
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discussion (Pitkin 1977b) of the need to consider it. On the other hand, tradi- 
tional EST discourse analysis does provide terms that appear to fit into the 
needed pattern — Problem and Solution. Looking at Figure 3, we intuitively 
feel that cause/effect fits nicely with what we might call problem/ solution, 
provided that the nature of problems in science are carefully considered 

Normally, cause/effect would mean that either a physical causality or a 
mental deduction exists, as in (7) and (8). 

(7) Because "X" exists, "Y M also exists. * 

(8) Because a X" exists, we must assume that "Y" exists. 

When we are talking of scientific investigation, it is not sufficient to state 
simply that a solution 44 Y W exists, nor that we assume it must exist. Rather, it is 
assumed that several possible solutions might be proposed for any given prob- 
lem, each of which is subject to validation. Problem/ solution in science, then, 
would indicate that for the given problem "X", "Y" is the most appropriate of 
all of the possible solutions, and further, that this can be demonstrated by the 
investigator's experiment. Problem solution means something more like (9). 

(9) Because we have found the problem *X", we proposed that the solu- 
tion * Y M be tested in the manner described in this report. 

Problem/ solution, then, appears to be a specialized iorm of the cause/effect 
relationship which serves a function unique to the type' of discourse in which 
it is used, that is, the introduction to an experiment report. 

Turning to the other discourse relationship which Figure 3 and Figure 4 
have in common, we find the concession/assertion, or true X/yet Y relationship. 
It will be easier to discuss this particular part of the Diagram if we have 
reference to an actual section of discourse in which it appears. Consider, there- 
fore, Figure 5. 

Figure 5. 

There have l>een numerous recent reports of the immunologic, biochemical 
and histwhemical properties of the neoplastic cells of hairy cell leukemia. 
Little attention has been devoted, HOWEVER, to the stromal changes in in- 
volved organs in this disease. (Nanba, i t al., 1977) 

Figure 5 indicates that certain information has been reported and that 
other information has not been reported in a certain area of pathology. This 
is sometimes called a review of the literature, or a review of previous research. 
We are concerned with this content, but we are also concerned with the use 
of the true X/yet Y relationship. As Figure 6 indicates, X and Y could have 
been employed. 

Figure 6. 

There hav> been numerous iccent reports of the immunologic, biochemical 
and histochemical properties of the neoplastic cells of hairy cell leukemia. 
AND little attention has been devoted to the stromal changes in involved 
organs in this disease. 
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Figure L, however, strikes us as rather unnatural. Contrast, rather than simple 
coordination, expresses in an explicit way the nature of problems, in the context 
of experimental reports. In a scientific sense, a problem is a situation in which 
some sort of conflict exists. As in Figure 5, the conflict may be between avail- 
ability of information in one area and the lack of information in another. In 
-other cases, the conflict may be a "dissonance" (cf. Borkin 1978) between two 
competing theories. The use of the explicit contrast /contrast relationship, then, 
is to be understood as the expression of some sort of conflict within the area of 
interest. That conflict is the problem. Based on Ihe content of reports like 
Figure 5, the terms previous research and data lacking have been developed 
as descriptions for the two units of discourse involved. Note that using the 
expression previous research/ data lacking also means true X/yet Y will be 
used explicitly. 

Let us turn now to the effect unit of Figure 3. Figure 3 illustrates one of 
the more common forms of development for the introduction, in which the 
effect, or solution, is a single unit. Consider, for example, Figure 7. 

Figure 7. 

The purpose of this pilot study is to examine the educational environment 
of the bilingually-schooled child . . . (Bruck and Schultz 1977). 

As Figure 7 indicates, this type of introduction contains a statement of the 
purpose of the experiment. There is no specific hypothesis stated (contrary 
to the predictions of Lackstrom, et al. ), but there is the suppressed prediction 
that Certain kinds of information will be gained. \ln Figure 7, the information 
is the educational environment of two children. This type of solution unit has 
been called PURPOSE to indicate that it is a statement of the purpose of the 
experiment, although not necessarily an explanation of the hypothesis it is 
designed to provide evidence for. Although it may be further elaborated, as 
was the original from which Figure 7 is excerpted, it has a single rhetorical 
function, one common to all such introductions. £ 

Figure 4 illustrates the other sort of solution unit that may appear. Consider 
Figure 8. 

Figure 8. 

With respect to the motor innervation, the question arises whether the alpha 
and gama efferent fibres would regenerate at different times . . . it was sup- 
posed that the failure of regeneration of an intact g.imma system was re- 
sponsible for the lack of the tetanus symptomology in these, animals. 
IF this interpretation is correct, th'» marked differential of the motor fibres 
would provide an interesting new means of obtaining a preparation in which 
the alpha innervation is stiii absent. The intent of the present paper is to 
provide more directly supporting evidence for this. (K'anha ct al., 1977) 

The first sentence is, in fact, part of the Problem unit, being a summary of 
the first several paragraphs in which the Problem is developed, as we have 
suggested above. Then, following a bri< " stretch of discourse in which reasons 
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for a particular solution are cited, the solution is presented, 4 it was supposed 
that . . " The hypothesis is, of course, that there is a difference in the rate of 
regeneration. 

The second paragraph is what we have referred to as the D of C. It con- 
sists of a conditional cause/effect, if X/Y. If the hypothesis is true, then, the 
result will be us indicated, 4 an interesting new means of obtaining a preparation 
in which the alpha innervation is still absent". Finally, the now-familiar Purpose 
statement appears. 

Based on the evidence of introductions of the sort depicted in Figures 5, 
7, and 8, we have developed an idealized version of the Introdnctjpn to an 
Experimental Report. It contains each of the discourse functions noted above, 
and employs a simple convention to show where the rhetorical functions cor- 
relate. Consider Figure 9. * ! 



Figure 9. 
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CURRENT THEORY 



CONFLICTING DATA 



SOLUTION (PURPOSE) 
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I RESbLTSOF" 
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The rhetorical functions of EST discourse have been entered into the Bloc 
Diagram in the positions corresponding to the explicit discourse functions which 
correlate with them. For example, the PROBLEM has replaced the cause, while 
SOLUTION has replaced effect at the main level of this Diagram. In order to 
establish clearly that these are implicit rhetorical functions of EST, rather 
than the explicit discourse functions, they have heen printed in upper case 
letters. Nate that PROBLEM SOLUTION is a specific type of cause/effect 
relationship; the use of the term PREVIOUS RESEARCH DATA LACKING 
means that there is a true X yet Y relationship involved, arid so on. Although 
we have not yet dealt with the terms CURRENT THEORY CONFLICTING 
DATA, they have also been placed in Figure 9 sin^e many PROBLEMS are 
developed through this procedure. 

Tims far, our use of examples has been limited to isolated sections. To 
conclude, let us look quickly at one complete section of discourse comprising the 
introduction to a report published in a recent TESOL Quarterly in Figure 10. 

Figure 10. 

A Final Sample Introduction j 

(IA1) Much teaching material has become .ivaihihle recently in the field of 
English for Science and Technology for the non-native student. (IA2) VET 
most of this material is geared primarily toward the teaching of reading 
comprehension (IB la) Our experience with foreign students at the Masters 
level has led us to believe that the greatest strength of many EFL students 
K in reading tcchiuc.il English, (IBlb) particularly in their field of special- 
ization, (IB2*i) WHEREAS their writing skills are seldom as well developed. 
(IB2h) The ability to write acceptable- scientific English becomes a major 
concern for those students who must successfully complete a thesis as part 
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of their degree requirements. (HA) THUS we have developed a writing pro- 
gram based on recent research in EST in an effort to help our students pre- 
pare 'or the final stage m their degree programs. The purposes of this paper, 
THEN, are. (IIB1 ) l) to identify that part • . the literature we found most 
applicable to classroom use; (IIB2) 2) to indicate the steps we took to trans- 
late research findings into course syllabus; and (HB3) 3) to describe the re- 
sulting teaching materials (Weisberg and Buker 1978) 

Figure It. 
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T n Figure 10, the discourse is developed along the lines we have suggested 
in the idealized T ntroduction, Figure 9. Note that the overall relationship is 
PROBLEM/SOLUTION. The break between these two stages comes at the 
unit marked (HA), which begins with the bloc signal THUS. The PROBLEM 
is developed as a contrast between PREVIOUS RESEARCH, units marked (IA), 
and CONFLICTING DATA, marked (IB). The previous work referred to is 
the abundance of F.ST material designed for reading comprehension. The con- 
flicting data is that, in the authors' experience, the students need more writing 
material and less reading material. As a result of tlis situation, the solution 
(a writing program) was developed Finally, the SOLUTION " unit states 
precisely the three purposes of the paper, which are, basically, to describe that 
writing program. 

Wn might comment that it is unusual to find an introduction so clearly 
and concisely developed as this one. One might postulate, therefore, that the 
authors have been influenced in a positive way by their n ork with EST dis- 
course. 

S. Conclusions 

In t 1 s paper I have tried to indicate how the explicit discourse functions, 
common to any written discourse, can be used with specialized functions within 
a limited universe of discourse such as thai r EST, and how these functions 
must be interpreted, therefore, as implicit in the use of sucn discourse functions. 

Specifically, we found that the introduction to all experimental reports will 
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employ certain standard discourse ft notions— cause/effect, contrast/ contrast, 
and, possibly, conditional cause/effect. Each of these ^understood to have an 
implicit rhetorical meaning: PROBLEM/ SOLUTION for cause/effect, contrast 
between PREVIOUS RESEARCH and LACKING DATA for contrast/ contrast , 
and the DEDUCTION OF THE CONSEQUENCES OF /HE HYPOTHESIS 
for conditional cause/effect. Although the introduction is not limited to just 
these functions, and although they may appear in widely divergent forms; we 
can predict that they will appear and that the development of the Introduction 
will depend on therri. 

There is a rhetorical, competence involved in tuc proper interpretation of 
these implicit rhetorical functions. This rhetorical competence can and should 
be part of -my advanced EST reading or writing course for ESL students. 
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Test Bias in Language Placement Examinations 

HetMin Forhody 

University of California, los Angtlts 

This study investigated possible unfair decisions made on the basis of the re- 
sults of placement examinations whieh consist of a variety of subtests. In 1WU^ 
Carroll argued against generally accepted discrete-point tests and recommended 
tests of integrative skills. Since then, there has been an ongoing debate among 
scholars, some of whom ativocate discrete-point and others who prefer integra- 
tive tests However, a question which has not been emoirically answered is 
whether students from different linguistic, educational, and cultural back- 
grounds are disadvantaged in their performance on either discrete-point or in- 
tegrative tests. , , . 

To address the issue, the performance of foreign students on these two 
types of tests was investigated. The subjects were 350 incoming foreign stu- 
dents representing eight differed countries. The instrument used was the place- 
ment examination at UCLA'which consisted of eleven subtests 

The results of this study strongly support the hypothesis that some toreign 
student grouos perform better on discrete-point subtests and others on integra- 
tive subtests. Finally, the effects of these differences on placing foreign students 
,0 appropriate ESL courses' arc discussed and three alternatives are s uggested 
to avoid disadvantaging any group of students. 
The great number of foreign students attending American universities has 
intensified the problem of placing these* students in appropriate ESL courses. 
In most cases, placement examinations consist of a variety of subtests whose 
purpose is to assess student competencies in various language skdls (listening, 
reading, writing, and speaking) and in various linguistic areas such as phonol- 
ogy, morphology, and syntax. Oft< , incoming foreign students are no equa ly 
competent in all language skills, due to their educational, cultural and linguistic 
'backgrounds. This diversity may influence their performance on language pro- 
ficiency tests used for placement purposes. This paper attempts to demonstrate 
the effect,' of different .ypes of tests in assir-ng foreign students to vanous 
language courses as well as possible unfair decisions made on the basis of test 
scores. 

1. Background 

, Teaching and testing are sometimes appropriately referred to as a single 
process because modifications on the former have influenced the latter. It was 
not until the rise in prominance of the audio-iingual -pedagogical approach that 
the ESL teaching strategics, and consequently testing procedures, underwent 
fundamental changes. Thereafter, the theory of language testing developed by 
Lado ( 1961) and others resulted in the so-called discrete-point testing approach. 
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The advocates of this approach believe that, sinpe language is comprised of 
sounds, intonation patterns, morphemes, and the meaningful arrangement of 
words, testing these elements would enable teachers to effectively assess their 
students* abilities in various language skills. «• 

Later, when the audio-lingual method was challenged by proponents of 
other approaches, the testing procedures developed under the influence of the 
audiolingual method were also subject to question. Carroll (1981} was the first 
to trigger the reform by stating, *\ . . I recommend tests in which there is less 
attention paid to specific structure points or lexicon than to the total com- 
municative effect of an utterance" (p. 318). Carroll referred to this type of 
testing as th** integrative approach. Scholars are still debating the choices, some 
. advocating discrete-point tests, others tests of integrative skills, 

Mudi has been written on the adequacies and shortcomings of both dis- 
crete-point and integrative tests (Oiler 1973a, 1974, 1976b; Briere 1970, 1972; 
Rand 1972; Jakobovits 1970;- Spolsky 1973; Hinofotis 1976). However, an un- 
investigated issue is that students from different linguistic and educational back- 
grounds may perform differently on discrete : point and integrative tests. 

The purpose of this paper, then, is to investigate: 1) potential differences 
between the performance of foreign students on discrete-point and integrative 
tests, 2) the effect of the sex variable on the subjects' performance on discrete- 
point and integrative tests, 3) possible unfair decisions made due to biased test 
scores, and' 4) the effect of selecting one test over the other for student place- 
ment in ESL courses. The final section of the paper is devoted to suggesting 
alternatives for overcoming some of the problems with existing placement 
procedures when placement is based on the composite score of several subtests. 

Various Subtests of the ESLPE 

Discrete- Point Integrative 

Spelling 22 items Reading 23 items 

Punctuation 22 items ' Listening 30 items 

Structure 62 items Dictation 50 items 

Vocabulary 45 items Cloze 50 items 

2. Method 

2.1 Subjects. The subjects in this study were 491 inqonifng foreign students 
who took the Fall 77 version of the UCLA English as a Second Language Place- 
ment Examination (ES1.PE). They represented eight -linguistically and educa- 
tionally different -groi ps. Table 1 represents the number of subjects from each 
country. 

Previous research on the ESLPE (Sanneii, 1977) demonstrated that factors 
such as major field of study and academic status did not have any significant 
effect on the performance of foreign students. Consequently, the subjects in 
this study were not categorized on the basis of the above mentioned variables. 

2.2 Instrumentation and Procedures. Since 1956, the University of California, 
Los Angeles has l>een administering different versions of the ESLPE to all in- 
coming foreign students whose native language is not English. Research on 
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TABLE I 



The number of students from different countries taking the Fall-77 ESLPE 



Country 



Number of Students 



Inn 
Taiwan 
Korea 
Japan 

Hong Kong 

France 

Mexico 

Israel 

Others 



56 
51 
53 
44 

27 
27 
25 
22 
186 
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various forms of the ESLPE indicates that the composite score of the subtests 
tends to accurately measure the students' overall language proficiency (Connolly 
1972). The Fall 77 version of the ESLPE consisted of eleven subtests, evenly 
divided into two subsections. The discrete-point subsection consisted of spelling 
(22 items), punctuation (22 items), structure including articles, prepositions, 
and verb forms (62 items), and vocabulary both Romance and Germanic (45 
items). The integrative subsection included reading comprehension (23 items), 
listening comprehension both visual and written (30 items), dictation (50 
items), and cloze (50 items). The subjects taking the test were allowed three 
and a half hours to complete the test and a bio-data sheet. Hie ESLPE scoring 
system allowed one point for each correct response. The students were not 
penalized for guessing, and blanks were counted as wrong answers. The cloze 
test was scored by the exact-word scoring procedure. 

It should be mentioned that classifying a test as either discrete-point or 
integrative seems to be difficult and sometimes inappropriate. Most testing 
specialists, including Oiler, believe that the difference between discrete-point 
and integrative tests is a matter of degree rather than type. We may consider 
the two along a continuum ranging from highly discrete-point items at the one 
end to highly integrative items at the other (Oiler 1973b). Nonetheless, discrete- 
point tests generally aim at testing one and only one point of language at a time. 
Integrative tests, on the other hand, are intended to assess language proficiency 
in terms of holistic performance, sometines akin to actual functions of the 
language in real life situations. However, due to their nature, the subtests of 
the ESLPE were easily categorized as belonging to either the discrete-point end 



2Ji Data Analysis and Results. Subtest scores were standardized to achieve 
equal weights before adding the total score. Then two separate analyses were 
conducted. > 

Analysis 1: The standardized scores of. the 305 subjects wer. analyzed 
utilizing a two way analysis of variance (ANOVA). The results of the ANOVA,- 
presented in table 2, supported the hypothesis that iomc foreign student groups, 
depending on their educational and linguistic backgrounds, performed better 




0 



ERIC 



170 



4 



Test Bias 165 



TABLE 2 

The icores of eight language group* on DP and IN subsections* 



- i - 1 

Country 


DP 


-_ ~r.--.~z: - 
- - _ J 


— '- 
IN 






Mean 


SD 


Mean 






Hong* Kong 
, Israel 


218 


27.15 


221 




24.84 


201 


28.70 


* 218 




29.98 


France 


204 


32.59 


213 




34.62 


Inn 


188 


35 J2 


198 




34.34 


Mexico 


199 


35.55 


199 




39.10 


Taiwan 


209 


38.34 


200 




27.70 


Korea 


196 


33.68 


187 




31.52 


Japan 


192 


35.37 


184 




33.80 


* DP. Discrate-point ^ 



IN. IntemtWe 



on the discrete-point subsection and others performed better on the inu %tive 
subsection. 

It is ,clear that, except for Mexican students, who scored equally well on 
both types of tests, there were significant differences among students from 
various countries. Students from Iran, France, Israel, and Hong Kong scored 
higher on the integrative subsection. Students from Taiwan, Korea, and Japan , 
scored higher on the discrete-point subsection. The results of this analysis also 
irdicated that students from Hong Kong and Japan obtained the highest and 
the lowest total scores on the ESLPE, respectively. Finally and more inter- 
estingly, the difference between the scores on the discrete-point and integrative 
subsections was greater for the students from Taiwan and Israel than for stu- 
dents from other countries. However, the difference was in the opposite direc- 
tion, i.e., students, from Taiwan scored significantly higher on the discrete-sub- 
section, whereas Israeli students scored higher on the integrative subsection. 
Figure 1 illustrates the relative standing of the students from each country. 

Analysis 2: In this analysis, the scores of male and female students (N= 
491 ) on the discrete-point and integrative subsections of the same ESLPE were 
compared. The data were analyzed using a regression model of discrete-point 
subsection (¥) on the integrative subsection (X,), sex (X.), and the interaction 
between the two (X, and X.). The results of this analysis, which are reported 
in table 3, indicated that eighty percent of the variation in the discrete-point 

TABLE 3 
Regression Statistics on DP and IN 



Variable MR R 3 h SEE F Mean SD^ 

Integrative .89 ~80 .85 11.41 ~ 1990 7 * 91.52 25.68 

Sex .89 80 9.97 11.38 7" .67 .47 

Sex X Integrat .89 .80 -.10 II A3 6** 61.59 49.58 

Constant = 23.40 



•• P < .01 

subsection was accounted for by the integrative subsection. The contribution 
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Figure 2. Regression Lines for Males and Females. 



Female 




\ nothing to our information when included in the model simultaneously. 



He negative value of the regression coefficient for the interaction effect 
indicated that the two regression lines intersect at X=110.70 and Y=117.57. 
Thus, below this point, males outperformed females on the discrete-point sub- 
section, whereas above the point of intersection, females sc red higher than 
males. This interaction effect is illustrated in figure 2. 

To establish regies of significance, the formula developed by Johnson and 
Neyman (1936) was utilized, with the lunge of significance calculated to be 
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between X= 136.20 and X—94.80. This implied that, within the range of these 
values, the scores on the discrete-point subsection were not significantly different 
across the groups. Nonetheless, students who scored beyond and below these 
values on the integrative subsection scored significantly different on the discrete* 
point subsection. Since the possible attainable scores on the discrete-point and 
integrative subsections were well below and beyond the range of significance 
(151), it seemed important to consider sex as a potential variable which could 
influence the performance of foreign students on placement examinations. Since 
the resrlts of this analysis did not confirm the findings of previous research men- 
tioned above (Sanneh 1977), until further investigation, these conclusions should 
Ik? considered speculative rather than definitive. 

3. Discussion 

The notions of discrete-point an f a integrative tests deserve serious attention 
in the development of second language proficiency tests. In this study, .students 
from different sexes as well as different educational backgrounds were found to 
perform better or worse on one or the other type of test. No matter how reliable, 
valid, and practical a given test may be, there will be some undesirable biases 
due to variables which are not directly relevant to the underlying constructs 
of the tests. < 

The UCLA ESLPE, a statistically well established instrument, having a 
high internal consistency (.97) and being validated ugainst several criteria, 
could be considered a good placement test. However, depending on the number 
and/or weights of the subtests, a student from a specific educational background 
could be placed in either a higher or lower level course tfmn is justified. An 
example may help clarify the pouu. Suppose that there are two types of tests 
(test A which is discrete-point and test B which is integrative^, two groups of 
students (group 1 from Taiwan and group 2 from Israel, which both have 
similar proficiency levels), and three levels of ESL courses (elementary, inter- 
mediate, and advanced). If the two groups took test A. the student^ from 
Taiwan would perform better than the students from Israel. Thus, by obtaining 
high total scores, they would be placed in the advanced course; the Israeli 
students would be placed in the elementary course because they would not be 
able to perform to their fullest potential on a discrete-punt test (see Figure 3). 
On the other hand, if test B was administered to the same groups, the reverse 
would occur (see Figure 4). 

There is still a third possibility wherein the two tests are combined and given 
to the same groups. In this case, both groups would obtain fairly equal total 
scores and be placed in the same level (see Figure 5). 

Such hypothetical placement may seem exaggerated, but it could occur, !n 
short, regardless of the abilities of the examinees, some will be hvored or penal- 
ized because of the nature of test being administered. Keeping the above dis- 
cussion in mind, the following points seem important. 

1. As long as the controversy between discrete-point and integrative tests 
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is not resolved empirically and definitively, attempts should be made to keep a 
balance between the two in language proficiency tests. Since the underlying 
constructs of overall language proficiency have not been identified, it seems 
advisable not to assess the language proficiency of the students with only one 
specific type of test. 

2. Since ESL students in academic environments need different language 
skills than their non-academic counterparts, it seems important to distinguish 
the extent to which different language skills (survival and academic) are re- 
quired of university students in competition with native speakers. Of course, 
students are required to use both types of language skills, but academic English 
is more important to them than to a tourist or a person who intends to live 
in an English speaking community. Students are required to listen to lectures, 
read textbooks, participate in class discussion, be familiar with spelling and 
mechanics of writing, and produce acceptable written and oral forms of lan- 
guage; such a demand is not required of a non-student foreigner. Therefore, 
university proficiency tests, especially placement examinations, should be more 
thorough than other types of tests by including academically important as well 
as survival language skills. 

3. In the case of course placement, as mentioned before, the procedure for 
using the total scores as a basis for placing students in different language courses 
has flaws. The main limitation of this procedure may be resulting differences 
in the linguistic and communicative abilities of the students placed at the same 
level of instruction. A student who is proficient in grammar is not necessarily 

** competent in communicative skills. Also, a student who feels comfortable in 
communicating with native speakers is not necessarily highly skilled in utilizing 
discrete linguistic elements of the language. Therefore, instead of placing stu- 
dents on the basis of their total scores, they may be more apfiopriately placed 
according to their individual language abilities, which may be reflected in their 
performance on the discrete-point and integrative tests. As a result of such a 
placement procedure, instead of having sequential courses, skill-focused classes 
might be more appropriate. In this case, a student who was quite competent 
in one skill would not be required to take the course* which emphasizes that skill. 
Instead, he would take courses where the emphasis was on the skills necessary 
to better fulfill his language needs. 

It is true that language skills are highly interrelated, but still there seems 
to be an imbalance in the abilities of foreign students in different skills. This 
may be due in part to the absolute difficulties of the skills as well as to the 
previous education of the students in which certain skills received more em- 
phasis than others. 

In summary, the conclusions drawn from the results of this investigation 
and their implications are important for placement purposes, especially in the 
United States. The suggestions will hopefully encourage officials and 'test de- 
velopers to pay close attention to the type of the test as well as the language 
and educational backgrounds of the examinees in order to avoid unfair decisions 
on the basis of test results. 
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Cultural Bias in Reading Comprehension Tests 

9 Bernard Mohan 

University of British Columbia 

Some test items in reading comprehension tests test both language knowledge 
and cultural knowledge. Various standardised tests contain them and thereby 
, discriminate against ESL students. This paper demonstrates how test items for 
commonly used reading comprehension tests are culturally biased, indicates the 
degree and location of bias in a number of popular tests, and outlines a pro- 
cedure for detecting bias that teachers can use. The view is taken that evalu- 
ating these test items is an application of linguistic semantics, which means 
that judging their content validity can be put on a solid theoretical basis. 

Reading comprehension is one of the various activities that ESL students 
perform where differences of culture can lead to differences of interpretation. 
Li test situations such differences of interpretation become serious difficulties. 
This paper will examine the problem that the culturally different pose for read- 
ing comprehension tests. Three questions are of interest: 

a) Is there cultural bias in reading comprehension tests, and if so, does 
it matter that there is bias (e.g., does the bias amount to discrimina- 
tion)? 

b) How can teachers and students, the consumers of tests, identify bias? 

c) Is cultural bias in reading comprehension tests inevitable in principle, 
or are there systematic ways in which it can be avoided or at least 
minimized? 

These questions will be approached through the analysis of question items 
from test . However the analysis will be linguistic rather/than statistical and 
a particular aim will be to show bow teachers can do s\ich an analysis for 
themselves. * + 

Consider the ioJlo^iiyj/test items where a student will have to rely on 
specific socio-cujuiral knowledge to get the right answer. The items come from 
widely used tests of reading comprehension: the Stanford Diagnostic Reading 
Test, the Canadian Test of Basic Skills (a revision of the Iowa Test of Basic 
Skills for Canadian use), the Gates McGim'tie Reading Test, the Nelson-Denny 
Reading Test and the Capadol Reading Comprehension Test. They have been 
chosen to show a range of cultural knowledge and have been altered non- 
significantly for reasons of copyright and brevity* ( In each) case the form of 
the test and the item number is given so that the original can be identified). 

1 ) Patriotic objects, 'There are red & white stripes and white stars in our flag* Our flag 

contains one . - for evei> Jate/ 

a) stripe b) star (S.D.R.T. Form X. Level 1. #32) 
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2) Food In the story the French reflated potatoes like most OMclian* regarjl ' 

a) spinach b) tomatoes c) horsemeat c!) margarine 

(C.T.B.S. Form 1, #89) (Note, in the story the French dislike potatoes.) 

3) Custom* 'in thLs poem, what does "April Fool!" mean?' 

(CT.BS Form 1, #98) 
a) T^eVrson who said it was fooling h) It .vas not April at all 

4) GamexyJSam won at marbles localise he could straighter than Hill ' 

aMhow I)) shoot c) draw d) run 

(C. McC D Form 3M. #1) 

5) History 'In the first colonies in America, making clothing took time. The women first 

had to spin the yarn Clothes for the colonial family jpere "Mially »»ade ln 

a) factories b) homes c) luxury d) China 

((; McOC Form K. #W) 

«) Geography The Yankee peddler traded as far west .is the Mississippi and as far south 
a\ Louisiana. He operated 
a) over most of the country *») as far south ns Louisiana 

(N.D. Form D #10) 

7) Folklore Tam went to the party with a tall pointed black hat, long black cape and a 

broom She was dressed a| .i . . * - 

witch b) ghost c) cowgirl d) pumpkin 

(C McG.C Form 1 #4) 

8) Housing 'Bill ran out on his front porch to watch the firetruck. He lives m 

a) a big apaitment b) a city house c) .1 tr\iW 

(G T.B.S Form 4. Level 9 # 1 ) 

9) Culture-hound metaphors 'What does "applying soft soaj* mean in paragraph 6?' 

v a) looking clean in public !>) flattering people , (CTBS Form L #131) 

tO) Stories familiar to particular cultural groups. (Clo/e test example^ 

The person Vas holding tight to the handle of an open , dangling by one 

hand like a doll tied to the string of a bnlloon ' " "~ 

((;. Form ^ #2) 

In order to mswer these one needs io know about ( successively ) : the 
symbolism of the U S. flag; Canadian food preferences, j\pril 1st customs; how 
marbles is played, early American history, American geography; the garb of the 
typical witch, the layout different types of North American .homes an English 
idiom; and the story Of Mary Poppins. It is clear that special cultural knowledge 
is required, and therefore that there is indeed cultural bias in reading compre - 
hension texts. 

The procedure nsejl to identify biased items can be summarized in two 
questions. # 

11) Does the item test language knowledge, or does it te knowledge of the world too? 

In (12) it might seem that knowledge of the world concerning witches and 
Halloween is needed to answer the item. Yet on closer inspection it becomes 

12) The playmates wore costumes to Sandra's Halloween party Nellie ^ A tall, 
blaik. pointed hat* 



a) walked b) wore i) cared d) hurt e) looked j 

(C MeG. I) Formf.iM. #7) 




clear that the* item can be answered on the basis of language knowledge .since 
answer (b) is the only one which fills the slot to make a semantically acceptable, 
sentence. The second question 
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H) Does i\w item test a 1 knowledge u the world which is available to all uilturts or 
o) knowltd K t> of the world which is readily a\ailahlr only to particular cultural groups 
Item (14) tests knowledge of the world. Tins, not language knowledge, is 
the basis for seeing a eonnection between playing all day and feeling tired. 
However, «ince it is reasonable to assume that every cultural group would see 
such u connection, the item does not depend on a piece of special cultural 
information. It docs not test knowledge ol the world available only to particular 
cultural groups. 

14) Sam\in<l Hill reived the whole da\ In the exemiif! thev felt 

a) rested l> ) s.nalJ c) tired (SD.HT. Font, \ U-\el 1. #17) 

By this procedure items (12) and (14) aie not considered to be culturally 
biased, although they might appear to be. On the other hand, items (1-10) 
are considered culturally biased, for in each case special cultural information 
is required and the answer to question (13b) is positive. 

Havin„ established that cultural bias exists, the next question is whether 
the presence ol bias is an important matter. Certainly at least three of the tests 
from which items (1-10) are drawn are widely used: the Stanford Diagnostic 
Heading Test, the Catcs-McCinitie and the Canadian Test of Basic Skills. One 
aspect of this question is the frequency and distribution of bias. For example, 
one can count the percentage of biased items in a test. Mv investigation of the 
Canadian Test of Basic Skills, Form 1 (grades ^-8) indicates that 7¥ of the 
items are biased. This is by no means negligible in its likely effect on the scores 
of culturally different students. Moreover the biased items arc unevenly dis- 
tributed, so that ,\ .he grade 5 section of the test the percentage of biased 
iter, is 15%. Other tests show such clustering too. Thus the Gates-NfeCinitie 
Prim<> C Form 1 test contains l(#*biused items and in the Sta- ford Diagnostic 
Reading Test Form X Level ! the percentage nses to 167. T it ,e percentages 
are h:gh enough to ha\e a marked effect on score.. In additirn, these figures 
are somewhat conservative and a'most certainly underestimate the amount of 
biaj». Items which are borderline cases have not been counted, and my analysis 
of 'h e items, as a member of the host culture, is likely to overlook some of the 
less obvious cases of bias. ' 

Another a-peet of the importance of bias is the question of whetti?r the 
decisions made jbout individual students on the basis of biased reading com- 
prehension tests are tii\ial orser-oi. For example, reading comprehension 
scores ,e often used to assign students to reading groups of different levels 
of competence. .Icr" a bins v result is likely to lead to incorrect diagnosis of 
difficulties in reading and inappropriate teaching. M<rre serious is the situation 
.n some schools when leading comprehension scores are the means to p!"*e 
students in streams ol diflerent general academic ability i.e. a low store puts 
students in slow class, possibly for the rest of their school career. When a, read- 
ing comprehension score is used as an index of academic ability, a b ased score 
and the decision following Iron it can ha\e all the force of a self-fulfilling 
prophecy. 
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Even if it is accepted that bias exists in quantity and that it may affect 
important decisions, it is still possible to argue that bias is not a matter worthy 
of serious attention. One argument is that students should be tested on cultural 
knowledge since they will lip* be able to function pioperly in North American 
society if tiny do not Irani about North Aniencan culture. A reply to this is 
that it is certain!) appropriate to test for cultural knowledge bin that testing 
for cultural knowledge should not be confounded with testing for eading com- 
prehension. If one tests for cultural km -..edge one should test for it separately, 
specifically and systematically, as some Modern Language teachers do. A further 
and much stronger argument denies that cultural knowledge should be tested 
separately and asserts that special cultural knowledge should be a necessary 
part of reading comprehension tests The argument .(which has been advanced 
by competent and thoughtful ESI, teachers) is based on the claims that special 
cultural knowledge will inevitably occur in reading materials that students will 
have to deal with, and that t reading test should faithfully reflect the reading 
tasks and materiak facing the student. 

\Vlme there are a number of objections to be made here (e.g., how much 
special cultural knowledge is actually required bv a math textbook?) what is 
-^T^rticularly wrong with this view is that it concentrates on the students' per- 
formance but ignores the evaluation being made of that performance. Suppose 
two students give the wrong answer to a reading comprehension item. One is 
Canadian, the other a recent immigrant. Bn'\ have misunderstood, but the 
evaluation made of the first may not hold for the second The former may be 
weak in reading comprehension, the latter may be missing some 4 cultural knowl- 
edge. The task they both face is the same, but the interpretation of what they 
do is different. One cannot automatically conclude that the immigrant is weak 
in reading comprehension, / 

From what has been said so far, various suggestions can be made about 
the second question: how teachers and students can identify bias and what ac- 
tion they might take. First, teachers can investigate the items in the reading 
comprehension test which is currently being used in their school and in am' 
other available reading test which nvght be used in its place. A procedure for 
doing so has been outlined above in (11-14). Croup discussion of the items, 
either with other teachers or with KSL students, works v vll since the lone 
reader often passes over difficulties that a group will detect and makes cultural 
assumptions that ESL students will be more conscious of. and it is an activity 
which is justifiable in itself, since it requires careful reading and interpretation. 
If the current reading com prehei test turns out to be more biased than an 
available alternative, then a case can be made foi switching to the alternative 
♦est. Secondly, once biased items have been noied. teachers can check to see 
that the bias is taken into account in the interpretation of test scores and in the 
decisions about students that are based on such interpretation, 

Finally we turn to the third question: whether cultural bias in reading 
compiehension tests is inevitable in principle, or is avoidable in systematic ways. 
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What has been demonstrated so far is that some test items confound language 
knowledge and cultural experience. In effect these items are not simply testing 
the ability to rfrad the comprehension passage, they are testing for o.her in- 
formation as well. As such, they are poor tests of reading comprehension. The 
maker of standardized tests has certain procedures qf statistical item-analysis 
which ar*» intended to ensure that items meet standards of validity and relia- 
bility, yet these do not .seem to have been successful in eliminating culturally 
biased items in the tests examined. The reason for the presence of biased items 
appears to be that the initial scrutiny of items for suitability (or content va- 
lidity) is very crude and allows biased items in. The remedy offered here is to 
relate the analysis of items (as shown in 1-14 above ) to semantic theory in order 
to show that the analysis of items can be based on semantic principles. I be- 
lieve that this is helpful for clarifying both cultural bias and the general issue 
of the content validity of reading comprehension items 

The heart of semantic theory is the re'ation of Inference ^or consequence 
or implication) If one can show the role of inference in reading comprehension 
items, one can show the connection between the analysis of these items and 
semantic theoiy. (15) is a clear example of the inference implication conse- 
quence relation. "The girls Were putting icing on the cakes" Passage. 

15) " . Three girls put frosting on the cakes that had cooled. We learned a new word for . 
"frosting". At the Bakery it was called "icing" '* 

Q. What were the three girls doing? 
A 1 ) cooling the cake's 

2) putting icing on the cakes 

3) making frosting for the cakes 

(CBT.S Form 1 #9) 

Can the answer to 15 be inferred from the passage*^ The passage implies that 
answer. The passage does not imply answer (1) and it does not imply answer 
(3). The general form of the relations between the passage and the answer is 
given in (16a), and this general form can be simplified still further, as shown 

16) a) Passage implies right^ answer sentence 

Pas.sage doesn't imply* wrong answer sentence 
(or contradicts) 
h)"Si implies S» — true? false** 

in (16b) to the question of whether one sentence (S,) implies another sen- 
tence (SA Thus (15) at its simpl" t requires the reader to decide whether 
"three girls put frosting on the :akes that had cooled" implies 'the girls were 
putting icing on the cakes'. (A check on this is that "therefore*' oi "conse- 
quently" can be appropriately inserted between these two sentences hut not 
betwee.i the passage sentence and the other possible answers). This is the 
typical pattern tor reading comprehension test the coirect aimver sentence 
can be inferred from one passage sentence (or a combination of passage sen- 
tences), and more generally, the student is being asked to judge whether one 
sentence follows from another oi not. In this win inference plays a central role 
in reading comprehension items. 
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There arc a number of variant forms of tjp^nattcrn show.ii in (16a). Foe 
example, (14) takes a different form, in that (most of the answer ^entente is 
part of the passage. Yet when the correct word\s inserted the second sentence 
follows froln the first "Sain and Hill played the whole day" implies "In the 
evening the) felt tired." ft does not imply the possible sentence made by the 
insertions of "rested" or 'small (14) conforms to the general form ot (18b). 
However, this is not to say that every comprehension question conforms to 
(16b). Clo/e items like (10) minimally icqmre that the word inserted in the 
slot should complete the sentence to form a semantical ly acceptable sentence. 
If that is all that ;s required in an item, it is demanding recognition of rela- 
tions within a sentence rather than recognition of implication, which is a rela- 
tion between sentences. 

The concept of implication has an important be*, .g on the analysis of 
cultural bias. *It has already been noted that the cultural bias problem in read- 
ing comprehension tests arises because both language knowledge and cultural 
knowledge are required in these tt ^s. Clearly it would be helpful to distinguish 
betw^n language knowledge and cultural knowledge m a principled way. 
(15) is ai. item winch tests language knowledge and (7) is one which tests 
cultural kno vledge. Both require the recognition of an implication. The im- 
plication in* ( 15) is "based on language knowledge and the implication in (7) is 
based on cultural knowledge. The distinction between these two kinds of im- 
plication is almost identical to the distinction seiqanticists like Katz and Leech 
have made between semantic implication, which is based on knowledge of 
language and factual implication, which is based on knowledge of the m world 
(which would include cultural knowledge). 

This is a very important and useful connection. Even though it is not un- 
coil trove rsial and not without borderline problems, the distinction between 
semantic implication and factual implication is based on clearly argued prin- 
ciples^ Katz 1972, pg. R7ff,, Leech 1<)74. pg. 7ff) and the *ame arguments can 
be used to establish the distinction Ik ,veen language knowledge and cultural 
knowledge in test items. In addition, a major achievement of linguistic semantics 
is the detailed analysis of semantic implication in natural language. Th:s means 
that the analysis of semantic implication in reading comprehension test items 
can be supported by a developed Bod> of knowledge. This is not to say that 
the body o! knowledge is complete and the analysis is automatic for a great 
deal still needs to be .done. Yet it is clear that the analysn can he systematic. 

Nor is this to say that reading comprehension test items should be re c tricted 
to those that solely test semantu implication. (17) identifies a nninbrr of dif- 
ferent bases for iniplieat ; on, ^ i?b ) requires some further explanation. In (15) 
thr passage tells the reader that "frosting" and "icing' aie to be considered as 

17) Imfilwx on the basis of: 

a) knowledge of language — language tests 

))) information in the passage (g.med hv using (a)) 

— Reading Comp tests e.g. (15) 
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t) general knowledge of the world (supporting (a) & (b)) 



— Reading Comp, tests eg (14) 



d) special cultural knowledge 

e) special 'subject" knowledge 

(a) Seniantic/Lint^nshc implication 



— Cultural tests e.g. (7) 
— A^nievement tests 



u), (d) & (e) factual implication 



synonyms, and skilled readers will treat the words as such even if they art 
not synonyms in the readers' own usage. In other words, information in the 
passage is capable of overriding the speakers' personal knowledge of their lan- 
guage. This is an extreme case to illustrate the importance of passage informa- 
tion. The use of passage information is a necessary feature of reading com- 
prehension test "Items, for otherwise it would be possible to answer the items 
without reference to the passage. Since (13) implies ti.u* general information 
about the world (17c) is also acceptable, it is claimed here that 17 (a-c) are 
permissible in reading comprehension test items. It has been argued above that 
special cultural knowledge (17d) is not permissible and 17(e) indicates that 
the same holds for the type of special subject knowledge that is typically in- 
cluded in achievement test items. The distinction between semantic and factual 
implication would draw the line in a different and inappropriate place, exclud* 
ing every thing but (17a). So semantic theory is helpful because it provides 
a principled basis for making distinctions which are essential, but read ng com- 
prehension test items are not limited to semantic implication and reading com- 
prehension tests are not equivalent to semantic tests. A final point is that the 
exclusion of (17d) and (17e) is meant to hold for tests only, and not for class- 
work in reading comprehension. In a test the lack of special cultural informa- 
tion can be a stumbling block whereas in a classroom situation it can be a 
valuable point of discussion. 

To summarize, items like ( 1-10) are illegitimate in tests of reading com- 
prehension when tnken 'by ESL students because they require special cultural 
knowledge which is available only to particular cultural groups. This special 
knowledge need not be incorporated into reading tests. Readirig tests should 
essentially test the ability to use knowledge of the language and reading skills 
to get meaning from the printed page; it is the task of achievement tests (ot 
biology, social studies, etc. ) to test for non-linguistic knowledge. It is # legitimate 
to draw on non-linguistic knowledge in a reading comprehension test only 
where it plays a neutral, supporting role, and is common to all students taking 
the test. It should be standard policy to eliminate items like (1-10) from tests 
when those taking the test include culturally different groups. In fact this 
would be likely to improve the test for all groups by eliminating material which 
is irrelevant to the aims of the -test. This elimination can he made using the 
intuitive procedure outlined in (11-14) above. Linguistic semantic theory plays 
a crucial role in establishing the major distinctions required and in providing 
the detailed semantic description necessary to put the procedure on a systematic 
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Evaluating Bilingual Competence: An Experimental 

Innovative Test 

J. Donald Sowan 

University of California, Los Angeles 

Sandra Plann 

No'throp University 

Evaluation of linguistic competence, inherent in language teaching, is par- 
ticularly important in bilingual education, where a need exists to measure 
achievement in both languages. In fact testing is mandated where public money 
is provided, since funding agencies rightly feel thert should he a determna- 
tion of the value of programs supported. 

Since both languages are tested, a comparison of proficiency will be more 
meaningful if we are confident that test results acre* language ooundaries are 
indeed comparable. Attempts to design equivalent evaluations <t a>e led to ex- 
perimental applications of cloze testing, bilingual syntax measures, etc The 
effort should continue:, the need is great and even negative reports ran he 
useful. r *s* r 

Encouraged by thtf success of an experimental integrative grammar tesi 
in English (IGT), we wish to apply to Spanish the relatively simple concept 
of identifying the second word in sentences where phonological features are 
obscured at word boundaries. The test has proved useful in English, with re- 
markably high coefficients of validity and' reliability. This 100-item test, ad- 
ministered by tape recording in 16 minutes, is very efficient. 

This article reports on a project which had its genesis in the development 
of an English proficiency test. Ws experimental test attempts to measure-a - 
subject s competence in the structure, of informal spoken English, employing 
an exclusively integrative format. The test, referred to as the IGT (for Integra- 
tive Grammar Test) rests on a very simple assumption: that ability to handle 
the morphophonemic alternations in English, especially those that characterize 
-variation at word boundaries in informal oral communication, will correlate 
meaningfully with competence in using the grammatical patterns of the lan- 
guage. Native-speakers of English very well on this test, usually scoring 80 
to 100 percent, but non-native subjects, particularly those with indifferent con- 
trol of the morphophonemic features, find the te«;t difficult. The score? of non- 
native speakers ran^e from nil to native-speaker range, and there is evidence 
that the spread of scores for non-natives reflects with reasonable accurac y the 
varying levels of ability of the examinees. 

The test has been reported on at various conferences and in articles th*** 
have appeared in professional journals. We will therefore illustrate the test with 
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just one triad of examples. The task is to listen carefully to a short sentence, 
determine the full form of the second word, and write it down on a test answer 
sheet. The subject* tries to make the identifications as s/he hears the item sen- 
tences: 

e- 1. Whuttaya want with that cat? 

2. Whattaya doin with that cat? 

3. Whattaya done with that cat? 

The answers, so easy for a native speaker that s/he *is likely to miss a 
few items because attention wanders, can be surprisingly difficult for non-native 
subjects. Herein lies the value of the test. 

First, let's look at the example items. The answers are do, are, have, but 
this information is not physically present to the subject, who hears for all three: 
/watayd/; the /wat/ initial syllable is what in each case; the third syllable is 
/y»/ for ytoi. The second syllable, crucial for the test, is reduced to the ultimate 
— a simple, weak-stressed schwa, pronounced /S/. The examinee has to look 
further into the sentence Br the evidence that disambiguates this schwa, by 
sorting out the lexical-grammatical co-occurrence potential of each sentence: 
want, doing, done require the interpretations do want, are doing, have done. 

Indications are that the English version of this experimental instrument is 
reasonably successful. An administration to 686 applicants for admission to the 
American University in Cairo developed the following figures for the English 
version of the IGT: 

Validity: A coefficient of correlation of .871 between the IGT and the 

grammar subtest of the MTELP (Michigan Test of English 
Language Proficiency). 

Reliability: A coefficient of correlation of .968 from a test/retest set of 

scores, inherent in the test design. 

Efficiency; One reading only of each item with an item flow of approxi- 

mately one every 6 or 7 seconds. The entire test, including 
instructions and~ sample items, can be completed in about 18 
minutes. 

Administration: The test is presented on tape which helps guarantee com- 
parable conditions for different administrations. The taped 
delivery also controls the pace of the test, to provide the 
time pressure that helps keep validity and reliability high. 

Correction: Items are highly objective and can be accurately hand cor- 

rected ( depending on score and on the clarity of the subject's 
handwriting) at the rate of one and a half minutes to 3 or 
4 minutes per test paper. 

Our task, then, was to construct an aural Spanish test which, like the 
English test, would measure competence in the structure of Spoken Spanish. 
We began by thinking of items, often in pairs, Whose comprehension depends 
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on catching a phonetic, syntactic, or semantic clue. For instance, the clue was 
phonetic in the pair, 

Una perra es lo que quiere esa muchacha rubia 

and 

Una perra es lo que quiere esa muchacha rubia. 
The clue was syntactic in: 

Las hab:a memorizado para el examen 

and 

Las sabfa de memoria para el examen, 
and, a semantic clue allowed the differentiation between: 
Los hirvi6 por 10 minutos para tenerlos bien cocidos 

and 

Lo sirvi<S y d^spu^s empezamos a comerlo. 

To be a valid item, native speakers of Spanish have to be able to identify and 
differentiate both sentences consistently when presented with the two together, 
or either one alone. If they can't, there's no point in asking students to do so. 

tried these items out with native speakers from various Spanish-speak- 
ing countries, and eliminated items that were judged to be too obscure, or to 
contain regional vocabulary. We tried to be particularly careful in choosing 
common vocabulary items, to make the test intelligible to both a variety of 
native speakers and to students of Spanish whose primary contact with the 
language may have been in the classroom. 

After this initial screening of items, we piloted a preliminary version of 
the test on native speakers. We began refining the exam by then eliminating 
or modifying the items that everyone, or nearly .veryone, failed to identify. For 
instance, an* item ' in an early' version of the test v was: 

Las eleccipnes de ayer nos sorprendieron. 
Almost everyone wrote lecciones instead of elecciones for the second word, 
due, we assumed, to the academic setting in which the test was taken- To 
clarify this item, we added a semantic clue: 

Las elec^ones de ayer sorprendieron a los socialistas. Notice that the 
addition of thikbit of contextual information weights the clues heavily toward 
the semantic comWient, and a knowledge of Spanish vocabulary will aid con> 
sidcrably in the iden^^ation of the second word in the sentence. 

On the basis of thfete preliminaries, we put the fifty items in pairs and 
ordered them, from the el&est to the most difficult. On the second part of the 
test, the same fifty items w^Jisted/fn random ordor. We then made what we 
hoped would be a final recd^jj^of the examination. It was recorded by a 
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speaker of Castilian Spanish, with a very clear pronunciation that we felt would 
be most comprehensible to students of Spanish, and easily understood by native- 
speakers. However, both native and non-native speakers felt this was not the 
case, so we re-recorded the test, this time using a young man who had grown 
up in Venezuela, but whose parents were Cuban. The Carribean influence was 
noticeable in his pronunciation. Before making this recording, we took the 
opportunity to make a few final revisions of items, and the latest version of the 
test was ready. 

The Spanish IGT has not b' °n a success in th*» same way as the English 
test. It does not clearly separate native and non-native speakers. Native speakers 
in the classes tested range from the high <XXs to zero. It appears that a different 
kind of competence is being tapped. Whereas the typical items of the English 
test are relatively easy to do, requiring only a feeling for structural patterns of 
the language, most of the items on the Spanish test depend to some extent on 
the meanings of the sentences. In fact, in our attempts to strengthen some 
items that lacked discrimination, wc can see in almost all cases that we added 
semantic clues. As a result it seems likely that the revised test measures general 
verbal ability instead of sensitivity to the grammar. 

Some native speakers of Spanish do very well indeed, the highest score 
earned is 96. Non-native Spanish speakers also range widely, though usually not 
as far up or down as the natives. The best non-native score we have recorded 
is 90. These two scores are from a small group of language professionals who 
took the Spanish test. Their mean scores were: 

Natives- Non-natives 
Mean 82.3 G2.1 

Comparable data for a similar group in the English test is: 

Natives Non-natives 
Mean t 92.0 67.2 

Data that are more leveahng involve not postgraduate professionals with 
considerable linguistic sophistication, and experience, hut ordinary students. 
One group of students enrolled in the Santa Monica High School was espe- 
cially attractive for research purposes: They were studying in a bilingual class 
where both Spanish and English are extensively used and in addition the classes 
are roughly balanced with comparable numbers of native speakers of each 
language. AJI students, native in English or Spanish, took the Integrative Gram- 
mar Tests in both English and Spanish. To cancel any practice effect giving an 
advantage to the language whose test was administered second, half the stu- 
dents took the English test first and half the Spanish. The results of this adminis- 
tration can be seen in Table 1. 

The "N Adjusted" figure is reduced from N bv tin number of refusals, 
f hose who turned back a blank foim. The means, ranges, and scores show that 
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TABLE 1 

Performance Means of Santa Monica High School Students 
in Bilingual Social Studies Classes 



Class 



Native 
Speakers 


Language & 
Sequence of Test 


N 


N 

Adjusted 


Range 


SD 


Mean 


English 
Spanish 
English 
Spanish 


English First 
English First 
Spanish Second 
Spanish Second 


9 
15 

9 
15 


9 
7 
8 
14 


87.1 
44 
23.8 
41.9 


85-95 

0- 11 
12-37 

1- 85 


4.1 
4.3 
9.2 
31.4 


English 
Spanish 
English 
Spanish 


English Second 
English Second 
Spanish First 
Spanish First 


16 
8 
15 
10 

Summary 


16 
8 
15 
10 


83.2 
16.8 
21.7 
44.2 


72-89 
1-44 
8-44 
1-83 


#4.6 

17,4 
10,2 
27.0 



Combined Combined First Second 
English Means Spanish Means Test Test 



English 
Spanist. 



"ativt 1 Speakers 
ative Speakers 



85.1 
10.6 



22.7 
43.1 



54,4 
24.3 



53 5 
62.6 



both Qative-speaker groups do substantially better on tests in their own lan- 
guage, which is of course not surprising. But the English natives are in the-80's 
while the Spanish natives are in the 40's and 50 s. The English natives are also 
higher in their second language than are the Spanish natives. The Spanish 
natives characteristically have wider ranges and standard deviation figures, 
showing greater diversity in performance. The only case where Spanish natives 
have a low SD is where the range is from 0-11, which allows a very limited 
space within which deviation is possible. 

The Spanish natives do reasonably better on their second test, regardless 
of which language the test is in. This seems to be evidence that the Spanish 
test is measuring a general ability, in which practice is 'nfluential. The English 
natives actually have a small decline in their second test, showing a diminished 
mterest in a test that quickly becomes routine. The test is so easy (mean of 
87.1) that they tend to become inattentive, shown by a mean score that drops 
to 83.2 when the test comes second. 

The correlations between test scores ire very low and in no way meaning- 
ful. As Table 2 shows, getting a goo j score on one test does not mean a good 
score on the other. This suggests that first and second language sWHs are 
separate types of performance. 

TABLE 2 

Combined Correlation of Spanish a,nd English ICT Scores 



English Native Speakers 



English Test 
Spanish Test 



0 . i c » English Test 

Spanish Native Spcaxen SpanBh Tcs{ 



Mean 


SD 


Range 


842 


6.0 


66-90 


21.6 


10.3 


3-44 


14 0 


21.5 


0-44 


53.4 


25.2 


1-83 



106 



-.040 
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The test just reported is a replication of a very similar test given a >ear 
earlier, using a preliminary version of the Spanish IGT. We \v >u*t report this 
test in detail other than to say that in general the relationships between English 
and Spanish tests are shown to he quite stable. 

We do wish to cite a few correlation figures and the conclusions that seem 
to derive from the comparisons Our observations are as follows. 



Nahoe English Speakers- 

1. English test with course grades, 
r=.144. 

(Native language skills f "sted are 
unrelated to course achievement: 
test is focussed on applications of 
grammar, which everyone has 
^nastered. } 

2. ppanish test with course grades: 

rz=.113. 

(achievement m a second lan- 
guage is likewise unrelated to 
course grades. Test seems not to 
measure whatever is being mea- 
>HQ*d« for class evaluation.) 



Native Spanish Speakers 

1 Spanish test with course grades: 
r = .635. 

(Native language skills tested to 
some extent measure the same 
achievement as course grade test 
is apparently focussed on a more 
general verbal ability.) 

2. English test with course grades: 
r = .405. 

(a higher, but nonsignificant cor- 
relation figure. Perhaps at least 
some students are hot interested 
in the tests. There were 8 "re- 
fusals" — with zero scores, and an 
exhorbitant SD of 28.6!) 

3. Spanish test with English test: 
r = .455. 

(Closely similar.) v 



3. EngliMi test with Spanish test: 
r ^ .453. 

(a higher correlation, but still not 
very strong.) 

Another group of students took the Spanish IGT: students whose native 
language is Spanish enrolled in night classes in English language at the Evans 
Community School in downtown Los Angeles. This group provided an oppor- 
tunity to correlate test performance scores with various measures of opportunity 
to master English. The best correlation was with years in school, with a co- 
efficient of .609. This had not shown up at Santa Monica High School, because 
nearly all students/fiacl attended school the same number of years. The Evans 
data shows that ^fGT did not correlate positively with age. or with time spent 
in the U.S. These comparisons, made for two classes at Evans, showed a negative 
correlation ( .281 and - 592) for age — apparently older student* were less 
successful — and ( 46-3 and .139) for months spent in the U.S.A. It seems 
that the longer they stay in the US. (but still enroll for English instruction) 
the more likely they will have lower scores in their native language. Data for 
the performance in Spanish are as follows. 

C/<is\ X Mean Range SD 

1 9 fS.l 31-93 18.5 

2 17 71.4 7-92 21.9 
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The interesting details are the quite respectable means £m<l the lack of 
homogeneity shown by the large langes and standard deviation^ These adult 
students outscore the high school students at Santa Monica High.^ 

A more extensive administration of the Spanish IGT* suggests other kinds 
of comparisons. One is with the Spanish version of the MLA Cooperative Col- 
lege Placement Test The subjects were 153 first-term (new) student applicants 
to the Department of Spanish at UCLA seeking admission to classes the depart- 
ment offers. Actually it is a placement exercise, and the results of the examina- 
tion are advisory only. This allows the examination^ to be given in a rather 
informal atmosphere, with a minimum of the tension usually associated with 
examinations. The Spanish IGT was added as a research caboose, given between 
the two administrations of the Spanish exam (LA and MA). Actually the IGT 
took about the 20 minutes that were needed to score the LA. On the basis of 
those scores, students were invited to take the more advanced MA. The break 
points are shown in Table 3, 

^ TABLE 3 

Testing Standards for UCLA Applicants 



LA MA 



Span 1 156 or lower 

Span 2 157 to 166 

Span 3 167 to 173 

Span 4 J 177 or lower 

Span 5 1 178 to 183 

Span 25 184 or higher 



The Spanish IGT shows internal consistency by the data it produced in 
testing different language categories of examinees. The data> shown in Table 4, 
is just what we would predict from an acquaintance with the language back- 
ground of the applicants. The interesting category is Spanish surname, claim- 
ing English as their native language and no effective knowledge of Spanish. In 
fact they average 50 per cent higher than the Anglo applicants as a group. ' 

TABLE 4 

Performance on Spanish IGT by Native-Language Categories 



N 


Mean 


Range 


SD 


SM 


10 


66.8 


25-97 


18.95 


T99 


6 


62.2 


44-34 


16.99 


6,93 


15 


32.5 


2-63 


18.57 


4.90 


43 


33.4 


16-82 


14.48 


2.21 


116 


23.4 


1-82 


12.70 


1.18 



Native Spanish Speakers 
Bilingual Spanish-Er 'lish 
Spanish Surname (Nat. Eng.) 
Anglo (Scoring above 173) 
All Anglo Examinees ^ 

At the time this is written, the applicants have been assigned to classes. 
(That is, 95 of them have, the other 58, for one reason or another, did not 
appear on the official grade sheets.) They have been distributed by the LA 
and MA test scores into classes at six levels: the first two ye?rs of Spanish 
instruction given at UCLA. 
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The comparison and correlation of the scores is most interesting. Summaries 
of these scores are shown in Table 5. 



TABLE 5 
Tables of Performance Means by Class 



Spanish 
Spanish 
Spanish 

Spanish 
SpaniU^ 





M 


LA (108) 


MA (190) 


ICT (100) 


CPA (4.33) 


1 


8 


156.6 




10.9 


3.04 


2 


17 


160.5 


180.3 


18.1 


3.41 


3 


31 


169.2 


176.8 


24.1 


3.26 


4 


14 


174.4 


176.3 


23.5 


2.83 


5 


12 


178.3 


182.7 


36.6 


3.42 


25 


18 


181.6 


187.7 


52.5 


2.91 



The LA means .are naturally consistent, since they are the basis of course 
assignment, directly up to Spanish 3 and indirectly for second-year classes. The 
MA means for Spanish 4, 5, and 25 are consistent, again because they were 
used in counseling placement for those classes. The first year means for MA 
scores do not agree with course placement, which is unexpected, since LA and 
MA do not disagree with each other by much? The Spanish IGT scores are 
consistent except for scores at course levels 3 and 4. Other means are in a 
logical relationship, and have quite large gaps between levels. The GPA means 
are rot at all related to level, but there's no reason to expect them to be, since 
they are measures taken within each classroom and have no necessary relation- 
ship with other classes at the same level or classes on other levels. In fact, a 
consistent difference might be hard to explain. It looks like Spanish 4 and 25 
are perhaps more strictly evaluated than are levels 2 and 5. 

Table 6 shows the relative consistency of correlations (LA, MA, IGT, and 
CPA). 

TABLE 6 % 

Correlations Among Evaluation Measures 



MA 
IGT 
CPA 



MA 
ICT 
CPA 



Spanish 1 

LA MA 

.264 

.686 



Spanish 4 
.407 

.199 684 
-.230 .236 



ICT 



-.205 



.046 



Spanish 2 
LA MA 



ICT 



Spanish 3 
LA MA 



ICT 



.186 
.607 
.113 



-.055 
-.721 



.236 



.790 
.628 
.224 



.622 
.377 



.038 



Spanish 5 

.440 
.501 
500 



Spanish 25 



.351 
.010 



Combined Sections 



MA 
ICT 
CPA 



.640 
.676 
.091 



.623 
-.203 



.154 



.035 



.674 
.566 
.366 



.292 
.046 



.224 



The correlations at individual course levels run wild, ranging from LA-GPA 
.686 ir Spanish 1 to a —.230 in Spanish 4. Figures from the combined sections 
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show that none of the three tests conclates well with CPA. in fact the highest 
CPA correlation is a negative one ( 721 in Spanish 2). 

The combined sections show that IGT correlates best with LA (.676) 
and not badly w»th MA (.623) which is not quite as good as LA with MA 
(.640). iNone of these pi edict CPA, which is a correlation we had hoped to find. 

We set out on this project hoping that we might find a means of testing 
grainmaticaWjonipetence that could he applied across language boundaries. We 
have failed To do this, as least so far, which is a disappointment because the 
IGT idea looked hopeful. So, how do we account for the wide ran ire ::f native- 
speaker performance: from 96# to 0? 

In some instances, the Caribbean influence on oui speakers pronunciation 
may have played ;i pait. In this dialect, the final , s teiuK to be aspirated, or 
pronounced more like a bieathy h . Thus, las, for example, would sound more 
like lah. For examinees unfarnihai with this dialect, such a pronunciation could 
cause confusion on certain items, and a consequently lower score. Here's an 
example: 

La sopera esta en la mesa. 

Las opera manana nor !a unmana 
Here, the placement of the v — initial on sopera or final on las, is crucial to the 
identification of the second word the sentence. 

We think there is an even u e important consideration in explaining the 
great variation »n native-speaker scores. As I mentioned earlier, the features 
which differentiate test items are syntactic, phonetic, or semantic. The inter- 
pretation of the Spanish test items is heavily weighted toward the semantic 
component. The comprehension of only 365? of the items depends on a knowl- 
edge of Spanish grammar, interpretation of 162 of the sentences hinges on 
pronunciation differences. Howes er, the comprehension of 48# of the items is 
based on semantics. This is not the case with the English test, 86% of whose 
items are based on syntax. Interestingly, one bilingual examinee commented 
that he somehow did not leel that the two tests measured quite the same thing. 
His intuition may be a comment on th s dissimilarity between the two tests. 

A knowledge of the grammar is much more intuitive and universal among 
native-speakers than is a knowledge of the vocabulary. An instrument that 
tests • >cabnlary assesses, to some extent, the sophistication ol the examinees. 
In keeping with tins theory, we have noticed that the scores of the native- 
speaker examinees seem to correlate with their level of education. 

Revising the test so that more ol the items are based on syntax might 
remedy this problem. For instance, items like these would depend more on a 
knowledge of Spanish grammar tm then comprehension. 

II ;i sldo muy trnto v\ correo 

Has ido muy lento al correo 
We think that by makmg the Spanish test moie like the Kughsh veision in 
what it measures, the two, uM-d togethei, would piovidc a more accmate picture 
of btijngual balance. 
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An Alternative to the Cloze Test 

Karen A. Mullen 

University of Louisville 

An editing task has been suggested as an indirect means by which to assess 
second language proficiency (Bowen 1978). The' present study was conducted 
to see how well performance on this test compared to that on a writing task, 
an. oral interview, and a cloze parage. The first two represent direct, integra- 
tive tests of proficiency, both of which require excessive time to administer 
and/or correct. The third is an indirect, integrat /e test, also requiring a rela- 
tively large amount of correction time. Dat.i for this study were assembled i.i 
the form of two scores for the editing test (misidentification and non-identifi- 
cation), two scores for the cloze passage (exact-word and acceptable-word), 
and composition and interview evaluation scores. The results of the study show, 
first, the non -identification measure to be superior to the misidentification meas- 
ure or a simple addition of the two in predicting performance on the other 
tests. Second, this measure correlates more highly with the direct tests than 
the cloze-acceptable measure does. Third, the score variance is larger, more 
item-facility and item-discrimination indices fall within the acceptable range, 
and the inter-item reliability is significantly higher for this editing measure than 
for the cloze-acceptable score. In view of the amount of time required to arrive 
at an accept able- word cloze score and the less impressive performance of the 
t cloze in relation to the editing measure, the editing task is recommended as a 
more satisfactory indirect, integrative test of second-language proficiency. 

s 

The cloze test is considered a relalively good instrument for testing second 
language proficiency; it has several features to recommend it, the most inv 
portant being ease of construction (Pack, 1973) and high correlations with 
direct tests of second language proficiency (Hinofotis .1977, Mullen 1979c). It 
also has been shown that differences among subjects taking the test rather than 
differences between the two commonly accepted means of correction or differ- 
ences due to passage difficulty make the most significant contribution to explain- 
ing the score variance (Mullen 1979c). However, it has been suggested that 
correction be done by the acceptable-word method since it tends to produce 
sco.es w. n not only have a larger variance but also correlate more highly 
with criterion tests (Hinofotis 1977, Oiler 1972, Mu^en 1979c). Several proce- 
dures r t determining acceptability have been proposed, and studies have shown 
that the resulting scores do not differ significantly in the degree to which they 
correlate with criterion tests (Oiler 1972). But one feature common to all 
acceptable-word procedures is that tfiey must be rigorously defined and. applied 
and consequently much time is required for the correction. A second worrisome 
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feature of the cloze is the attitude subjects have toward it, considering it an 
inappropriate, even an unfair test, and therefore do not accent it as a bona fide 
means by which to evaluate their proficiency. Because of these practical prob- 
lems with the eloze— that of scoring time and face validity— the editing test 
(Bowen 1978) might be a suitable alternative. Like the cloze, it requires that 
the subject process the syntax and semantics of the text in order to restore it 
to its original form; in the cloze, this is done by supplying what is missing, in 
the editing by deleting what is alien. 

In view of the promising correlations to a criterion test such as the Mich-_ 
igan Battery in Bowen's study and the ease of scoring, a study was conducted 
at the University of Louisville *o see whether performance on an editing test 
compared favorably with performance on a cloze test and on two direct tests 
of second language proficiency— an oral interview and a writing task. 

L Method 

2.2 Subjects 

Of the fifty-Jour foreign students seeking admission to the University of 
Louisvi^ who served as subjects, some had finished intensive English programs 
and were applying for admission to a degree program while others had studied 
at a local community college and were liopmg to transfer to the University. A 
few were foreign students, recently graduated from high school, who were seek- 
ing admission to an undergraduate program. 

2.2 Materials and Design 

Four tests were administered to the subjects, an editing test, a cloze test, 
a writing task, and an oral interview. Some subjects had recently taken the 
TOEFL prior to the testing and their scores were included in the study. 

The editing test was based on a seventh-grade reading level passage taken 
from the Science Research Associates Reading Laboratory IV. The randomly 
inserted words were assembled by taking the first word in the second line of 
every left-handed page of an ESL text ot comparable reading difficulty. They 
were inserted one by one at random points in the text, determined by the throw 
of three dice. To keep the passage length closely similar to that of the cloze 
test and to make the average distance between insertions as close as possible 
to the distance between blanks in the cloze test, the positions available t for 
insertion ranged from after the fifth to after the fifteenth word in the base text. 
Subjects were told to cross out the fifty words which did not seem to fit in the 
text. A sample test is illustrated in Figure 1. It was corrected by first overlaying 
a stencil to reveal all words except those which were randomly inserted; any 
crossed-out words among these were counted as inisidentifieation errors. A 
second stencil disclosing the randomly inserted words was then placed over 
the passage; any word which a subject failed to cross out was counted as a 
non-identification error. This follows the proceduie described by Bowen. 

l'j- 
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Figure 1 
A Sample Editing Test 



This is a test designed to measure your proficiency in English. As 
you read the passage below, you will find that some words do not 
seem to belong in the text. Cross out the words which you think do 
not fit in the sentence or the passage. You should cross out fifty 
words. 




DIRECTIONS 



Example: Since man first appeared on earth he allowed has had 
to solve certain problems of survival, such earl y as 
hunger, thirst, and cold. 



In the long history of man's inventiveness pleased, discoverers seem, to fall 
into salary two classes. The first is the ingenious person who sets out interest- 
ing to find the solution to a problem. The second is choice the lucky one who 
appears to would stumble upon something by accident. But we should be clear 
what we mean never by accident, for the accidental aspect of many great 
discoveries and is that something unusual has happened when there is an 
observant person present who notices took what has happened and sets he to 
work to find out why. 

The cloze test was also based on a seventh -grade reading level passage. 
Every 10th word was deleted and replaced with a blank of uniform size. Fifty 
words were deleted. Subjects were told to supply the best word they could think 
of to fit the grammar and meaning of the text surrounding the blank. Responses 
matching the deletions from the original passage comprised the exact-word 
score. Responses considered as appropriate alternatives by two judges com- 
prised the acceptable -word score. In cases where the two judges could not 
agree, the response was submitted to a third judge 

In order to control for fatigue and a possible practice effect, half of the 
group was given the editing test first, the other half the cloze test first. Although 
there was no time limit, no subject took longer than an hour on each test. 
Following the first test, subjects wrote a composition from a choice of topics; 
they were told that their compositions would be evaluated for structure, organ- 
ization, quantity, and vocabulary. To arrive at an evaluative measure, a Likert- 
type scale divided into nine equal segments was provided for each of the four 
criteria mentioned above. The nine segments were labelled poor, fair, good, 
above average , and excellent or in between. Guidelines for determining designa- 
tions were available for reference (Mullen 1977a). Four readers evaluated each, 
composition on each of the four scales. The designations were converted to 
numerical values ranging from 1 for poor to 9 tor excellent. The final score was 
computed as a percentage based on the total accumulated points in relation to 
the maximum attainable. 

Forty-three of the fifty-four subjects were also interviewed. The procedure 
for conducting the interview and for evaluating the performance followed that 



ERIC 




190 



The Learner in Focu$ 



in other studies (Mullen 1977b, 1978). The scales were or comprehension, 
fluency, control over structure, and pronunciation. Guidelines were available for 
determining position along the scale. Values were assigned in the same manner 
as for the composition, and a percentage score was calculated, again based on 
the number of points accumulated in relation to the maximum attainable. Four 
persons evaluated the interview performance. 

2. Results and Discussion 

« Relevant descriptive measures are reported in Table 1. The mean'clo2e 
scores and the mean composition and interview scores are not significantly 
different from those derived in a larger study of 154 subjects using the same 
cloze test, composition task, and interview procedure. Mean TOEFL scores 
available for only a small segment of the subject group are also reported as a 
point of reference. , 

B TABLE 1 

Mean Score and Standard Deviations for Editing Test, Cloze Tests, 
TOEFL, and Composition and Interview Evaluations. 



Tests 


Mean 


S.D. 


N 


Editing — Non- identifications (# wrong) 


19.44 


13.62 


54 


Editing — Misidentifications ( # wrong ) 


17.98 


14.20 


54 


Cloze — Acceptable (# right) 


21.50 


10.45 


54 


Cloze— Exict (# right) 


14.90 


8.27 


54 


TOEFL — Listening Comprehension 


47.27 


5.56 


18 


TOEFL — Structure and writing Ability 


46.78 


7.35 


14 


TOE FL — Reading and Vocabulary 


48.21 


7.06 


14 


TOEFL — Total 


468.40 


51.66 


22 


Composition — % of possible points 


54.50 


21.98 


54 


Interview-— % of possible points 


62.41 


22.30 


43 



In considering how well performance op the editing test correlates with 
performance on the cloze test and the two direct proficiency tests, it is apparent 
that the non-identification measure produces a higher correlation than the mis- 
identification measure (Table 2). The effect of adding the latter to the former 
to compute an overall score generally results in depressed correlations. Clearly, 
the non- identification score is a better measure. It is also important to note that 

>0 TABUS 2 

Pearson Product Moment Correlations of Scores of the Editing Test with the 
Cloze Tests, the TOEFL, and the Composition and Interview Evaluations. 



Test 


Misident. 


Non-ident. 


Misident. -f 
Non-ident. 


N 


Cloze Exact 


-.25* 


-.73 # " 


-.58*" 


54 ' 


Cloze Acceptable 


-.39" 


-.85'" 


-,74" # 


54 


TOEFL — Listening 


-.32 


-.60" 


-.53* 


18 


TOEFL — Str. & Wtg. 


-.55" 


-.60" 


-.64" 


14 


TOEFL — Rd«. and Voc 


-.05 


.16 


.07 


14 


TOEFL — Total 


-.25 


-43 # 


-.43* 


22 


Composition 


-.39" 


-.82" # 


-.71 # " 


54 


Interview 


-.15 


-.74" # 


-.53"* 


43 


* t» <■ 05 ** p < 01 


*** p < .0001 










10<J 



A Cloze Test Alternative 



191 



the non-identification score correlates more highly with the acceptable-word 
method of cloze-scoring (—.84) than with the evact-word score (—.73). In 
comparing the acceptable word measure with the i ion -identification measure, 
the latter correlates more ! i^hly with the composition and the interview evalua- 
tions (Table 3). 

There are some other important features of the noii-ideiitirication measure 
when compared to the acceptable-word measure, reflected in Tables 4 and 5. 
One js that the score variance is larger, indicating that the editing test can 
spread scores further out along the scale, thereby allowing for finer discrimina- 
tion in levels of proficiency. An analysis of responses for each item in the two 
tests reveals that for the editing test, the mean item facility is within a more 
acceptable range (615f),' that no item falls below a facility index of 30$ or 
above 85$, and that fewer items fall below the recommended discrimination 

TABLE 3 

Pearson Product Moment Correlations of the Non-Identification Stores of the 
Editing Test and the Acceptable- Word Scores of the Cloze Test with the 
Composition and Inti'rvie.v Evaluations 



Test 

Composition 
Interview 



Cloze 
Acceptabv 

77~ 
.67 



Editing 
# Identified 



.82 
74 



54 
43 



TABLE 4 

Item Facihtes (IF) and Item Discriminations (ID) Based on Conocted 
Item-Total Correlations for the Cloze Acceptable Scoring and the 
Scoring of the Correctly Identified Insertions on the Editing Test. 



Item 

_# 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
II 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Cloze 


ID 


Editing 


Item 


Cloze 




Editim? 


IF 


IF 


ID 


# 


IF 


ID 


IF 


ID 


.83 


14 


.69 


.61 


26 


.33 


.52 " 


.54 


.36 


.80 


.53 


.80 


.45 


27 


.74 


.07 


.70 


.67 


.48 


.60 


.37 


.46 


28 


.52 


.49 


.44 


.43 


.46 


.51 


70 


.53 


29 


.46 


.56 


.83 


.68 


.44 


.59 


.70 


.47 


30 


.63 


.53 


.43 


.62 


22 


.26 


.78 


.39 


31 


.37 


.54 


.56 


.64 


.33 


.38 


.44 


.61 


32 


.70 


.59 


.54 


.44 


.35 


.55 


.76 


.37 


.33 


.35 


.19 


.81 


.54 


.59 


48 


.76 


.59 


34 


.28 


.56 


.81 


.68 


.44 


.48 


.81 


.62 


35 


.01 


.27 


.54 


.60 


.11 


.12 


.52 


.70 


36 


.50 


.64 


.30 


.29 


.70 


41 


.52 


.41 


37 


.35 


A5 


.65 


.54 


.31 


.44 


.83 


.49 


38 


.15 


.37 


.78 


.65 


.83 


.42 


.72 


.67 


39 


.11 


.34 


.31 


.50 


.38 


.50 


.65 


.38 


40 


48 


.10 


.35 


.34 


.37 


.26 


.61 


.74 


41 


,72 


.48 


.67 


.68 


.69 
U 


.71 


.54 


.58 


42 


.41 


.48 


.39 


.70 


.48 


.50 


.72 


43 


.15 


.35 , 


.80 


.62 


.93 


.45 


.44 


.42 


44 


.33 


.42 


.69 


.68 


37 


.32 


72 


.51 


45 


.20 


.38 


.80 


,67 


.07 


.48 


.76 


.68 


46 


.65 


.54 


.43 


.49 


39 


.45 


.48 


.51 


47 


.44 


.53 


.57 


.67 


.61 


.53 


.54 


.61 


' 48 


.20 


.33 


.70 


.66 


.,54 


.51 


.48 


.53 


49 


.13 


.38 


.50 


.47 


.55 


.31 


.70 


.73 


50 


.30 


.60 


.52 


.62 
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TABLE 5 

Comparison of Mean Scores, Item Facilities (IF), Item Discriminations (ID) Based on Item- 
Total Correlations, Related Standard Deviations, and Inter-Item reliabilities for the Cloze 
Acceptable Measure and the Editing Test Measure Based on Number of Insertions Correctly 
Identified. x 



Measure Clo/e Acceptable Editing {# Identified ) 



Mean Score 


21.50 


30.48 


S.D of Mean Score 


10.45 


13.02 


Mean IF 


.43 


01 


S D. of IF 


.22 


15 


Number of Items 
with IF below 30 


12 00 


0 


Number of Items 
with IF above .85 


1 00 


0 


Mean ID 


.13 


50 


S D of ID 


.14 


11 


Number of Items 
with ID below 30 


8.00 


1.00 


Inter-Item Reliability 


.92 


.96 



index of .30. A third feature is that the coefficient of inter item reliability is 
higher for the editing test than for the clo/e test This difference is significant 
at the .03 level 

In summary, the results o' this study show the non-identification score to 
be a better measure than either the misidentification score oi simple addition 
of the tw<v As measured In correlations with the composition and the interview, 
this measure is superior to the acceptable-word cloze measure. Moreover, the 
data show that the measure is superior in terms of score variance, item facility, 
item discrimination, and inter-item reliability. Finally, the test is easy to con^ 
struct and scoring requires less tune 
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Scoring Procedures For Use On Cloze Tests 
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Recent years have seen a growth m interest m the use of the cloze procedure 
in EFL proficiency testing. Some research .>eenis to have shown that the cloze 
is a valid measure of such proficuncy and that, as a so-called integrative test, 
it relates closely *<> other supposedly integrative measures of comprehension and 
production. Doubts h;ive centered on the scoring procedure to he used on the 
eloze test, l>ut previous research has produced somewhat ambiguous results. 
The present tendency is to use the exact word procedure. 

This paper will present the results of an m\ estimation into the use of five 
different scoring procedures, which will show that for the purpose of EFL 
proficiency testing, the semantic ally ace p table procedure is the most appro- 
priate. Evidence will be presented which shous that this procedure drastically 
reduces the effect of confounding vauaMcs like deletion frequency and text 
difficulty, and that it resu 1 * tn better discrimination of native and non-native 
speakers of English The , plications of this finding for an analysis of what the 
cloze test measures will be discussed, and it will be suggested that a radical 
revision of the uses of flu- procedure is necessary. 

Reseaicb into the use of different scoring procedures on clo/e tests with 
native speakers has generally come to the conclusion that the exact word 
procedure (where only icplacements of the exact woid deleted are counted as 
correct) is the most valid and practical way to score cloze tests. Taylor (1953). 
Rankin (1957) and Bornuith (1965) all find high correlations between the exact 
method, and methods allowing "any synonym" or "good enough answers" al- 
though the means of the latter scores were slightly higher. Bormuth (1965) 
found that the nearer the response came to the exact word, the higher were 
the correlations with other m i asures of text comprehension. However, using a 
"grammatically correct score". Hafner (1964) found a correlation of only .61 
with the exact word score. Fillenbaum et al. (1963) suggest that a form class * 
score — giving credit for replacements from the same form class as the deletion 
— might measure sensitivity to the close grammatical environment of the dele- 
tion, whilst the exact word score might depend more on sensitivity to remote 
semantic features of the discourse. In general, however, the practice with native 
speakers of English has been to use the Exact Word scoring procedure. 

With non-native speakers there has been a greater tendency to use scoring 
procedures other than the exact word, in fact, it is impossible to compare nvny 
of the research results using the clo/e procedure because the researchers have 
used different scoring procedures. In geneial, there is a feeling among EFL and 
ESL researchers and testeis that the exact word method is too harsh on a non- 
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native speaker of'the language. As Oiler (1972b) points out, gi\en the sentence, 
The climbed up the tree, a response of horse or elephant instead of mon- 

key is of a different order, i.e. displays a different level of language competence, 
from responses like table or with. Scoring procedures that have been used with 
non-native speakers have included the communality of response score (Carroll 
et al. 1959) and the elozentrophy score (Darnell 1968). In the former case, it 
\va., found that there was a high intercorrelation with the exact word score, and 
lower correlations with validating criteria ilian the exact word score achieved, 
and so the procedure was rejected. In the latter case however, a high validating 
correlation was achieved, although no intercorrelation with the exact procedure 
was possible, the clo/entrophy was held to be a promising alternative to the 
exact word procedure. 

Anderson (1972) got a high (.99) correiati^u>f the exact word procedure 
with a synonym scoring procedure, but unfortunately the latter procedure was 
in any case weighted in favour of the e*act word. The classic studies of the 
use of different scoring procedures with non-native speakers are by Oiler (1972b); 
Oiler, Atai and Irvine (1974); and Stubbs and Tucker (1974). Oiler (1972) 
found that he got a higher validity ^coefficient for his acceptable word scoring 
procedure than foAhe exact word procedure (.S3 and .75 respectively), and 
concluded that the acceptable word procedure was the best. Unfortunately he 
did not report intercorrelitions of the procedures*. Oiler, Atai and Irvir? (1974) 
found an intereorrelationjaf .94 between the exact word and the acceptable 
word procedure, and claimed that all validating correlations with the TOEFL 
were identical for both procedures, and therefore recommended use of the 
exact word procedure. However, they got a correlation of .75 with the, accept- - 
able procedure and dictation, against .69 using the exact word procedure and 
so one might have expected them to reach the opposite conclusion, given the 
assumption that cloze is more valid when it correlates higher with supposedly 
integrative tests like dictation. Stubbs and Tucker (1974) found an intercorrela- 
tion of .97 between the exact word method and an acceptable word procedure, 
and recommended the exact word method, ignoring the fact that the acceptable 
word method attained higher validating correlations than the exact wordrln 
fact, the differences between the correlations of the two procedures with the 
criterion entrance test of the American University of Beirut was of the same 
order as that found by Oiler (1972b) with his criterion, which led him to the 
opposite conclusion, i.e., use the acceptable word method. 

The studies reported a^e open to criticism on several grounds. In some 
cases the scoring procedures used are weighted in favour of the exact word, 
thus biasing the results. In other cases, only partial data are reported. Often 
the researchers merely report correlation coefficients, and not even the means 
and standard deviations of their tests. Conclusions are based on partial evidence 
— either on the mtercon-elations of the procedure under consideration or on the 
validating correlations, but rarely on both. In some cases, as mentioned above, 
clear co^nterevidence is ignored. 
O 
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In addition, and perhaps more seriously, time is a conspicuous failure to 
account for the similantv of the correlations that are achieved. Why does the 
exact word procedure lesult 111 apparently the same validity coefficients as the 
semantical^ acceptable vvoul procedure' Do the similarity of these coefficients 
and the high intei correlation of procedmes mean that in essence both procedures 
are measuring the same thing.- 1 At first sight this would seem to be counter- 
intuitive, since the abiht) to restoie the exact word may well depend, for cer- 
tain items at least, on the ability to detect distant relationships amongst elements 
in text, whereas the ability to restore with an acceptable vvu:d is at least argu- 
ably less dependent c,n discouise sensitivity . Intuitively, at least, one vvmld 
expect different scoring pmcedmes to lesult 111 ilifkiriit measures of some aspect 
of linguistic prohciencv— a< suggested by, for example. FilJenbanin et al. (1963) 
and Bormuth's results (1965), both lepoited above, as well as Hafner's (1964) 
finding that a gianiuuticallv coned score was less related to reading achieve- 
ment and intelligence than the evict vvoid score 

As part of an investigation into the effect of several variables on clo/e test 
performance, certain procedures used to scoie cloze tests were examined. Spe- 
cifically, five seoiing procedmes vveie developed, as described below. 

1. The exact word piocedure 

2. The semantic-ally acceptable piocedure (SEMAC) 
3 The same form clas, procedure (IDFC) 

4. The acceptable form class, same grammatical function procedure 
(ACFC) 

5 The grammaticallv collect pioceduie (CROO) 

1, Tin* exutt word pnncdarc In this proeeduie, miiioi misspellings are 
not counted as incorrect 

2. The snnanticu(hi acceptable procedure (SEMAC). The problem with 
this procedure is the decision as to whether a response is acceptable or not. Is 
it, for example, acceptable to icplace the name Mr. Vauzhan with the name 
Mr Smith, although no Mr, Smith has been mentioned in the text and will not 
be mentioned later? One marker may consider stvlistic infelicities to be un- 
acceptable, whilst another may find them acceptable Rather than develop elab- 
orate instructions foi acceptability scoring (see, for example, Clarke and Burdell, 
1977), which it was felt would never be applied in practical cloze testing, it 
was decided to compaie markers using the any acceptable word procedure, to 
^ ■ if practical agreement could be reached without any analytical scheme. The 
nature of this investigation is repoited elsewheie but. essentially, it was found 
that ten native speakei markers agieed with each other on acceptability, with 
correlations langing from .93 to .OS, and that they agreed with themselves after 
one month's interval, again between .93 and .98. Seven non-native speaker 
markets achieved the same levels of agieenien! and usability, and agreed with 
the native speaker maikers at between .81 and .93. Tljus, contrary to frequent 
claims and suppositions, it ts possible to achieve high agreement on accepta- 
bility. This authors marking for acceptability, computer-scored to achieve 100$ 
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objectivity, agreed with the native speakers' combined judgments at .99, and 
at .98 with non-native speakers The procedure was such as to disallow Mr. 
Smith for Mr. Vaughan in the case mentioned above, nor was a she admitted 
for a he where no female person had been mentioned, however, allowable re- 
placements for tjellotv in the sentence A _ win wallowed high above the 
mountain included words like blazing bright hot large lazy lovely magnificent 
nice red setting and watery. 

In an attempt to develop a scoring procedure measuring grammatical sensi- 
tivity, the following procedures were evolved. 

3. The same form class procedure (IDFCy It a response is a member of 
the same form class as the deleted word it is correct; if not it is incorrect—the 
semantic fit of the response is ignored. Problems of form class classification 
arose— is as the same form class as like or than? Aie he was due to go and he 
was able to go equivalent? A more serious problem was the severity of error. 
In the context He gazed dreamily at the Baptist preacher and they fought in 
order to sit next to Monroe the replacement of Baptist by yellow (different torm 
class) is of a" different order from the replacement of next by yellow or, indeed, 
by beside, presumably the «uine form class. For these reasons, the following 
modified scoring procedure was also used. 

4. Acceptable form class, same grammatical function (ACFC): If a re- 
sponse was from a form class which was acceptable in the context of the item, 
it was scored as correct, provided that the response had the same grammatical 
function as the deleted word. Grammaticahty of concord, number, tense, etc. 
was ignored. Thus, unlike the previous procedure (IDFC), The shop window 
was broken was an acceptable replacement for The Ing window was broken 
since shop and big— different form classes— both fulfill the function of modifier. 
Examples of this procedure, where one alternative would not be regarded as 
being an acceptable replacement for the other, are as follows: 

1 The ver y old gentleman was walking down the street. 

nice 

2 The man an( ' Ins dog can be s.rn in the distance 

with 

3. He carne wl, 7 she went. 

and 

4. The fc" nm sold all their rood. 

they , : 

The problem with this procedure is that it penalises grammatically correct 
responses (as 1-3 above) and allows grammatically incorrect responses A birds 
was singing happily. In order to get at a more complete grammatical sensitivity, 
this procedure was supplemented by th" final proccduie. 

5. The grammatically correct procedure (GRCO). Any response which 
fits t* » syntax of the context is cor.ect. It must agree in number, concord, etc. 
with the environment and be from an acceptable form class, but it need not 
have the sajiie grammatical function as the deletion. As far as possible, semantic 

q elatedness and appropriacy are ignored. 
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1. The study 

These five scoring procedures weie applied hy computer to the responses 
to 12 different cloze tests produced by deleting eveiy 6th, 8th, 10th and 12th 
word from three texts of varying difficulty (easy, medium and difficult respec- 
tively). (The difficulty of the texts was determined by a panel of EFL teachers 
and several readability formulae ( Smog, Fog, Flesch and Dale-Chall. ) The 
tests were given to 360 native speakers of English and 360 non-native speakers 
studying in the UK at tertiary level, which, since each subject took only one 
cloze test, gave 30 clo/e tests per group In addition the non-native speakers 
were given the English Language Battery (ELHA) (Ingram, 1964, 1973) which 
contains seven subtests, and two dictation tests, one easy, one difficult 

2. Results 



This papei will be pnmaiily concerned with the iion-nat>veSgeakt a r results. In 
point of tact, however, virtually all lesults weie the same for n^e and non- 
native speakers. 



As can be 


seen from Table 


1. the thiee 


texts were 


tanked in 


the same 


order, legardless 


of scoimg proet 


•dure. 
TABLE 1 










M,»an scores for 


cloze tests, non 


-native speake 


rs 




Deletion rate 6 














Exact 


Semac 


Crco 1 


Idfc 


Acfc 


Easy 

Medium 

Difficult 


30.3(1) 
20 1(2) 
148(3) 


42.4( 1 ) 
30.3(2) 
24.9(3) 


45.9(1) 
39.1(2) ' 
37.8(3) 


44.3( 1 ) 
38.9{2) 
31.2(3) 


45.7( 1 ) 
39.8(2) 
33 0(3) 


I3eletion rate S 














Exact 


Semac 


Crco 


Idfc 


Acfc 


Easy 

Medium 

Difficult 


29.8(1) 
19.9(2) 
10.0(3) 


40.7(1) 
3L3(2) 
21.6(3) 


44.6(1) 
40.4(2) 
31.3(3) 


42.7( 1 ) 
37.5(2) 
25.6(3) 


44 7(1) 
41.2(2) 
27.8(3) 


Deletion rate 10 














Exact 


Semac 


Grco 


Idfc 


Acfc 


Easy 

Medium 

Difficult 


30,8 ( 1 ) 
23.4(2) 
9.4(3) 


40.3(1) 
31.6(2) 
19.0(3) 


44.2(1) 
38.8(2) 
30.7(3) 


44.6 ( 1 ) 
39.5(2) 
27 0(3) 


44 7(1) 
41 2(2) 
287(3) 


Deletion rate 12 














Exact 


Semac 


Grco 


Idfc 


Acfc 


Easy 

Medium 

Difficult 


20.7(1) 
21.0(2) 
14.6(3) 


38.7(1) 
28.5(2) 
24.7(3) 


44.4(1) 

36.6(2) 
357(3) 


42.1(1) 
37*8(2) 
3#(3) 


44.5(1) 
38.5(2) 
31.4(3) 



It is more difficult for non-nati\e speakers to respond with replacements 
fulfilling the .same grammatical function as the deletion on a medium text than 
on an easy text, and easier to supply a grammatically correct replacement on a 
medium text than on a difficult text. No scoring proceduie, however peimissive, 
seems insensitive to the differences in text difficulty 
^Although an inspection of the means in Table 1 suggests that different means 
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were obtained on tlie same test by different scoring proceduiev t-tests tor dif- 
ferences between means were calculated to see if different scoring procedures 
resulted in significantly different means on an\ given test. On virtually all the 
comparisons, this proved to be the case. The only exceptions, i e., where no 
significant differences were found, were on the easier tevts, among grammatical 
scoring procedures (ACFC vs GRCO). Thus it can be concluded that using a 
different scoring proceduie results in different mean scores on virtually all elo/e 
tests, regardless of deletion frequency. 

The order of difficulty was: Exact. SEMAC, IDFC, ACFC, GRCO, it is 
easier to provide a grammatically conect response than a response with the 
same grammatical function or from the same form class as the deletion. As the 
constraints on replacement increase, so does the difficulty of closure. 

TaWe 2 shows the correlation coefficients for the intercorrehtions of the 
scoiing procedures. 

Table 2 shows that although the scoring procedures resulted in different 
mean scores, they show a high degiee of agieement with each other on the 
ranking of subjects— of 120 coefficients, the lowest is .71, whilst the highest is 
.99 and 47 out of 120 are at least .90. The closest relationship is among gram- 
matical procedures, whilst the lowest relationship is between exact and gram- 
matical procedures. The ability to predict grammatical function or to respond 
* with a grammatically correct response would appear not to be closely related 
to the ability to identify the exact word deleted— the former, ACFC, being 
taken to be the lowest form of grammatical sensitivity whilst the latter could 
be considered the highest form of sensitivity to style, author's intention and so 
on, if these procedures really do measure different abilities. However, even 
these t^o extremes— ACFC and EXACT— relate to each other at about .80, 
suggesting that, on the whole, the procedures might he measuring much the 
same ability. 

Although the exact woul score correlates best with the semantically ac- 
ceptable procedure (SEMAC), the SEMAC does not always correlate highest 
with the exact woul procedure, (tests D08, E12, E06, E10, M08, M10) some- 
times showing a closer relationship to the grammatical procedures. These find- 
ings are somewhat surprising in view of the claim that cloze tests are integrative 
tasks, and the parallel claim that the semantically acceptable piocedure is the 
most valid way of scoring responses It lemains to be st n whether different 
procedures measure different abilities. 

As has been reported ejsewbeie (Aldeison. stihmitted) varying the deletion 
frequency on a clo/e test results in significant differences in mean scores for 
some tests, the differences are not always in the expected direction (such that 
one would evpect 6 to be harder than H to be harder than 10 to be harder than 
12). and the\ vaiy from text to text llowevei, using am. scoring procedure 
other than tin* exact woru" greatly reduced the effect of the deletion frequency 
variable, whereas the exact piocedure resulted in significant differences on 
difficult and easy texts, when the SEMAC was used, no difference was found on 
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Difficult text > 


Semac 


^»rco 


xarc 


Acre , 


D06< 


Exact 


.90 


.71 


.73 


,75 




Semac 




.71 


.76 


.77 




Crnn 






.87 


: .92 




lUiC 








Qfi 

.wo 


DOS 


Exact 


.92 




.89 


" '.83 




Semac 




.3 


.95 


.90 




Crco 






.96" 


,97 




Idfc 








t .98 


D10 


Exact 


.92 


.74 


.78 


.74 




Semac 




.85 


.89 


.86 




Grco 




♦ 


.97 


.99 




Idfc 








.98 * 


D12 


Exact 


' .95 . 


• .84 


84 


83 




Semac 




.91 


.90 


v ..88 




Grco 


* 




.90 


.89 




Idfc 


* 


* 




.99 


Medium text 










M06 


Exact 


.94 


.84 


,76 


.77 




8emac 




.92 


.81 


.81 




Grco 






.89 


.90 




Idfc x 










M08 


Exact * 


.87 


" .83 


* 'I 4 ' 


.72 




Semac 




.90 - 




.82 




Grco 






to 


.87 




Idfc 








QO 


M10 


Exact 


.89 


.84 




OA 




Semac 




95 


.88 


.89 




Grco 






» t .93 


.94 




Idfc 








.95 


M12 


Exact 




.00 


AO 


,ov 




Semac 




.92 




.90 




Grco 








.94 




Idfc 








.99 


Easy 


text 






- 




E06 


Exact 


.88 


.8* 


.73 


.72 




Semac 




.92 


.7* 


.80 




Grco 








.87 




Idfc 










Em 


Exact 


.94 


.86 


.88 






Semac 




.93 


.86 


.83 ' 




Grco 






.81 


.84 




Idfc 








.95 


E10 


Exact 


.86 

i 


.81 


.81 


.81 




Semac 


94 


.86 


.87 




Grco 


i 




.87 


.88 




Idfc 








.99 


E12 


Exact 


.85 


.79 


.88 


.84 




Semac 




.90 


.89 


.91 




Grco 






* . .90 


.88 




Idfc 








.94 
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the easy or medium text, and only two of the six contrasts on the difficult text 
were significant. 

As a further aspect of the investigation of the five procedures, item facility 
and discrimination indices and test reliabilities \vere computed and the pro- 
cedures" were compared. From Table 3 it can be seen that the SEMAC results, 
in the highest number of acceptably difficult items on both medium and difficult 
texts. 







TABLE 3 








{sluml>er 


of items with a facility index between 




d 80i- . 




(Total 


numl^ of items = 50) 








Exact 


Semac 


Cxco 


~ Idfc* 


Acfo 


DOS 


24 


■ 11 


, 29 


35, 


32 


DOS 


18 


39 


" 38 


36 


39 


D10 


15 


35 


37 


38 


35 


D12 


20 


38 /, 


33 


33 


33 


M06 


.26 


37 


28 


•27 


25 


M08 


25 


34 


17 


. U 


12 


M10 


26 


31 


22 


' 19 


16 


M12 


31 


* S3 


29 


- 21 


20 


E06 


3.J 






8 


6 


EOS 


34 


• 22 


t: 


12 


10 


E10 


22 


.19 


8 


7 


7 


E12 


32 


* 23 


9 ' 


14 


8 



9 ■* 



The grammatical see: lug procedures produce more, acceptafWe^tem^^ 
ties than the. exact prceeduie on the difficult text, but on the other two texts 
they are clearly inferior to either the SEMrtC or the EXACT. In terms" of item 
discrimination, however, the SEMAC always results in the highest number of 
^discriminating items, regardless of text, and even the gramjnatical procedures 
give more discrimination than Jjie exact .word procedure on both difficult and 
medium .texts (Ttible 4>. 

' TABLE 4 * 

Number of items with a Discrimination Index (E, 3 ) of above +.2 * * 







Exact 


Semac 


Grco 


ldfe 


'Acfe 


D0H 




18 * 


30 _ 


30 


" 28 * ' 


26 


D08 




19 


. 41 


38 


34 


37 


D10 




1.3 


35 


34 


34 




D12 




18 


35 


31 


33 




M06 




24 


33 • 


29 


26 




M08 




24 


33 


25 


19 


/18 


M10 




21 


28 


23 


U) 


^ 19 


\112 




#3 


' 31 * 


33 


29 


28 


KOfl 




19 


20 


10 


13 


9 


K08 




26 


25 


13 


15 


13 


K10 




20 


23 


18 


11 


12 


K12 




23 


2(> 


19 


12 


9 



" The SEMAC is clearly superior to other proceduies, except on very easy 
texts^ in terms of item efficiency. If the exact word procedure is being used, 
then the text shou'd be an easy one, however, if the text is of medium or rela- 
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TABLE 5 

Reliabilities of cloze tests (number of lttms 



50) 



4 



1308 



mean 
s.d. 

s.e. as % 
KR 20 

mean 
s.d. 

s.e. as % 
KR 20 



D10 mean 
s.d. 

s.e. as ' 
k'R 20 

♦ i>12 mean 
*.d. 



Exact 

14.8 
5.3 
6.5* 
.76 

10.0 
4.9 
8.9 
.81 

9.4 
3.7 
7.3 
.69 

V 

14.6, 
5.2 



Semac 

24 9 
7.2 
5.3 
.82 

216 
9.6 
8.1 
.91 

19.0 
8.1 
7.8 
.87 

24.7 
8.8 



Grco 

37.8 
7.3 
3.5 
.88 

31.3 
10.0 
5.8 
.92 

30.7 
9.7 
5.8 
.91 

35.7 
8.3 



Idfc 

31.2 
6.6 
3.8 
80 

25.6 
8.4 
6.0 
.88 

27.0 
9.0 
6.0 
.90 

30.1 
7.9 



Ac* 

33.0 
6.8 
3.8 
.83 

27.8 
8.8 
5.8 



28.7 
9.1 
5.8 
\90 

31.4 
8.0 



s.e. as *& 


O.iJ 


o.i> 


4 O 

4. is 


4 O 

4.0 


4.6 


K'R 20 


.80 


.89 


.89 


.oo 




M06 mean 


20.1 


30.3 


. ■» 

39.1 


* 38.9 


39.8 


sd. 


*58 


82 


6.9 


5.9 


5.8 


s e as *k 


5.3 


5.0 


3.2 


2.8 


2.7 








.86 


.81 


.81 


M08 mean 


199' 


313 


40.4 


37.5 


41.2 


s.d. j 


~ 5 3 


7.9 


6.5 


4.5 


5.1 


se as % » 


4 9 * 


~~4 6- 


29 


2.2 


2.3 


KR 20* 


.76 


Z 87 


^.85 ' 


.74 


.81 


Mi() mean 


23.4 


31.6 


38.8 


39.5 


41.2 


sd. 


4.8 


6.4 


5.8 


4.8 


4.6 


v.e. as % 


38 


3.7 


2.7 


2.2 


2.0 


Kft 20 


.69 


. .80 


.81 


.74 


75 


Ml 2 Can 


21.0 


28.5 


36.6 


37.8 


38.5 


s.d 


5.8 


o.O 


7.9 


7.3 


7.4 


s.e as % 


5.1 


5.1 


4.0 


3.5 


3.5 


KR 20 


.78 


.88 


.89 


.88 


.89 


E06 mean* 


30.3 


42.4 


45.9 


44.3 


45.7 


s.d. 


4.3 


5.8 


4.3 


4.1 


3.9 


s*e. as < 


2.6 


25 


1.7 


1.7 


1.6 


KR '20. 


Z 53 


.84 


.82 


77 


.79 


E08 mean 


298 


40.7 


44.6 


42 7 


44.7 


s.d 


63 


6.3 


4.1 


3.7 


3.8 


s.e. as 2 


3$ 


2.8 


1.7 


1.6 




KIV 20 


79 


.84 


75 


.67 


,71 


E 10 mean 


30.8 


4(U 


44.2 


44.6 . 


44.7 


sal. 


5.6 


7.0 


5.0 


4.1 


4.1 


s.e. as 'a. 


3.3 


3.1 


2.1 


1.7 , 


1.7 


Krt 20 


79 


.89 


.84 


.77 


.77 


hi 2 mean 


* 26.7 


38.7 


44.4 


42.1 


44.5 


s.d. 


65 


7.9 


5.0 


4.4 


3.8 


se. as * 


4.4 


3.7 


2.1 


1.9 


1.5 


KR 20 <* 


.80 


.90 


.85 


.75 


■ .75 



tively high difficulty — usually the case 
comprehension — tWn the semanticalK' 
cedure to use. v 

ERLC 



in tests of EFI; proficiency and reading 
Acceptable procedure is the best pro- 
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Table 5 shows,, means, ;andard deviations, standard errors of the means 
as percentages of the mean, and Kuder Richardson 20 reliabilities for all scoring 
procedures. 

The dispersion achieved by the grammatical procedures is not as great as 
that achieved by the SEMAC or the Exact. The exact word procedure gives a 
distribution which is either greater than or equal to that of the SEMAC on all 
three texts, in relative terms (as. a proportion of the mean) although in absolute 
tenns the standard deviations of the exact word procedure remain remarkably 
constant at around 5.0, regardless of the variation in the mean over texts. In 
absolute terms the SEMAC almost always results in the highest standard devia- 
tion. However, the exact word procedure also results in a larger standard error 
than the SEMAC. Above all, the seinantically acceptable procedure is always 
more reliable ( as ^measured by KR 20) than the exact word procedure and at 
least as reliable as the grammatical procedures. In fact, the SEMAC's reliability 
coefficient is much more consistent than the exact word over deletion frequen- 
cies, ranging from .80 to .91 compared with the exact word procedures range 
of .53 to .81. Here again, the SEMAC reduces the variation in cloze test per- 
formance caused by changesJn the text and deletion frequency variables. 

Correlation coefficients ^ere calculated for the cloze tests scored by the 
five different procedures and the external measures of EFL proficiency — the 
easy md difficult dictation tests and the ELBA. The latter consists of seven 
Subtests: Sound Recognition (1), Intonation (2), Sentence Stress (3), General 
Listening Comprehension (4), Grammar (5), Vocabulary (6) and Reading 
Comprehension (7), 

Table 8 shows the correlations of each scoring procedure with the overall 
scores on ELBA, taken to be the best composite measure of EFL proficiency. 

Consistently the SEMAC produces amongst the highest correlations — it al- 
most always correlates higher than the exact word procedure, and with two 

TABLE 6 

Correlation of Scoring Procedures with ELBA Total 






Exact 


Semac 


Grco 


Idfc 


Acfc 


Easy text 












EOC 


.59 


. .74 


.60 


.44 


.45 


E08 


.70 


.69 


.61 


.50 


.46 


E10- 


.65 


.74 


.75 


.63 


.65 


E12 


.67 


.77 


.72 


.73 


.71 


Medium text 












M06 


86 


.88 


.81 


.67 


.68 


M08 


.68 


.77 


74 


51 


.50 


M10 


.57 


74 


.75 


.70 


.65 


M12 


.73 


78 


.75 


.70 


.69 


DifficrU text 












D06 


.51 


.67 


NS 


AZ 


.43 


DOS 


.82 


.87 


.73 


.80 


.74 


DIM 


.79 


.83 


.79 


.83 


.82 


Dh 


.77 


.85 


.68 


.72 


.70 
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exceptions it also correlates higher than the GRCO. In fact, the CRCO results 
in better correlations than the Exact procedure on deletion rates 10 and 12 on 
Easy and Medium texts. The SEMaC also gives the most consistently high 
correlation with EFL proficiency, ranging from .67 to .88 (Exact word pro- 
cedure ranges from .51-.86), thereby reducing the effect of deletion frequency 
and text changes. Thus one way to stabilise coefficients is to use the semantically 
acceptable procedure rather than the exact word procedure. The grammatical 
scoring procedures, in particular the form class procedures ACFC and IDFC, 
do not relate very closely to the ELBA criterion. Table 7 below allows one to 
compare each scoring procedure with the ELBA subtests and dictation (differ- 
ences between deletion frequencies are ignored). 

TABLE 7 

Correlations of Cloze Scores with ELBA and dictation by Text 

Difficult text 











ELBA 








Dictation 




1 


2 


3 




5 


6 


7 


E 


D 


Exact 


36 


49 


.25 




.71 


57 


.54 


.50 


.54 


Semac 


.50 


57 


30 


67 , 


.79 


73 


.63 


.58 


.59 


Grco 


37 


.49 


.35 


.50 


.72 


61 


.50 


52 


.51 


Idfc 


41 


.53 


34 


.57 


.74 


.64 


'.50 


52 


.52 


Acfc 


39 


50 


33 


.56 


.71 


.64 


-50 


.51 


.51 


Medium text 




























ELBA 








Dictation 




1 


2 


3 


4 


5 


6 


7 


E 


D 


Exact 


.49 


30 


36 


.56 


.58 


55 


.51 


.58 


.58 


Semac 


57 


.47 


.43 


.70 


.35 


.63 


.59 


.69 


.69 


Grco 


.59 


.36 ' 


.37 


.66 


.62 


.61 


55 


.62 


.66 


Idfc 


42 


.33 


44 


52 


.54 


.49 


49 


51 


.56 


Acfc 


.43 


29 


38 


52 


.53 


48 


50 


51 


,56 


Easy text 




y 
























ELBA 








Dictation 




1 


2 


3 


4 


5 


6 


7 


E 


D 


Exact 


.47 


47 


29 


.63 


58 


.55 


.47 


70 


.58 


Semac 


.55 


53 


.45 


.70 


69 


.61 


.58 


76 


.67 


Grco 


51 


44 


42 


.65 


66 


59 


.57 


.74 


.66 


Idfc 


,43 


43 


35 


57 


.54 


.46 


.47 


.64 


.58 


Acfc 


.43 


39 


.35 


.58 


56 


.48 


.49 


.69 


.62 



It might be supposed that grammatical scoring procedures would relate 
more to grammar subtests than to reading comprehension tests and the reverse 
for the exact word procedure. However, this did not happen. All scoring pro- 
cedures correlated most highly with Grammar (test 5), on the difficult text, and 
either Grammar or Listening Comprehension (test 4) on the other two lexts. 
The lowest correlations in all cases were with the segmental listening tests, but 
reading comprehension (7) also correlated uniformly low for all procedures. In 
virtually every case, the SEMAC correlated most closely with each subtest of 
the ELBA, regardles. of text difficulty. Only rarely did the exact word procedure 
correlate better with the subtests than the GRCO procedure. When the dicta- 
tion tests are also considered, the SEMAC again correlates higher with both 
tests than any other procedure. There is, however, no evidence that any one 
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scoring procedure relates consistently more to dictation than to ELBA or one 
of its subtests. Moreover, as reported separately (Alderson, 1978) the results of 
several factor analyses did not contribute evidence to the thesis that different 
scoring procedures measure different language skills, since the factors that 
emerged were more 1 or less equivalent across scoring procedures. 

Finally the non-native speakers' performance was compared with that of 
the native speakpi^ In EFL proficiency testing, it is usually assumed that native 
speakers wjiKlo uniformly well on a test, whilst non-native speakers will be 
discriminated according to their proficiency and, in fact, t-tests on mean scores 
showed native speakers to be always significantly better at the close task, re r 
gardless of scoring procedure. However, the differences between means were 
increased by using the SEMAC procedure, as the following table shows. 







TABLE 8 










Exact 


Semac 


Crew 


Idfc 


A etc 


Difficult text 












D06 


4.8 


83 


6 1 


3.8 


44 


1)08 


5.9 


129 


13.1 


109 


11.1 


D10 


5.J 


12.3 


10 5 


9.9 


10 1 


D12 


5 7 


10 2 


76 


8 I 


82 


Medium text 










4.7 


M06 - 


5 4 


82 


6.5 


4.4 


M08 


5 0 


8.2 


62 


3 8 


3.3 


M10 


64 


10. 1 


77 


54 


5 1 


\H2 


8.0 


10 6 


7 5 


5.3 


5.5 


Easy text 












E06 


4 0 


3.6 


2.5 


1.3 


"I 0 


EOH 


5.1 


4 6 


3.8 


2.7 


2.4 


E10 


1 8 


3 2 


.3.0 


1.9 


19 


E12 


3 4 


47 


3.1 


2.5 


2.0 


The SEMAC on 


both 


difficult and medium texts 


results in 


much greater 


erences than any 


of the 


other procedures. 


It also resulted in 


somewhat less 



overlap in performance between the two groups than the exact word procedure. 
In other word*, if what is required of a cloze test of EFL proficiency is that it 
maximise the discrimination between native and non-native speakers, then the 
most appropriate scoring procedure is the semantically acceptable procedure. 

3. Summary and discussion 

To summarise the results of this study it was found that different scoring 
procedures nearly always result in significantly different mean scores. Previous 
findings were confirmed that the exact word and the semantically acceptable 
procedures showed a high intccorrelation. Nevertheless, all scoring procedures 
showed a high level of intercorreiation, with the semantically acceptable pro- 
cedure relating as much to the grammatic .1 procedures as to the exact word 
procedure. The semantically acceptable procedure reduced the effect of changes 
in deletion frequency on mean test scores. It also gave the best item discrimina- 
tion and item facility indices, and produced the highest and most consistent 
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reliability. TheV^emantically acceptable procedure also gave the highest and 
most consistent correlation coefficients with the English Language Battery, both 
with the total score, and with the subtests. Thus, the semantically acceptable 
procedure would appear to produce the best measure of EFL proficiency. There 
was, however no evidence from the correlations and the factor analyses that 
different scoring procedures consistently measure different abilities. They seem 
to measure the same abilities, with greater or less efficiency. Moreover, the 
semantically acceptable procedure maximised the distinction between native 
and non-native speakers of English. 

-The results show that the existence of a high intercorrelation between 
scoring procedures is not a sufficient criterion to judge one cloze scoring pro- 
cedure against another, that other evidence needs to be taken into account, and 
that, indeed, the other evidence may be more important and lead to different 
^ conclusions. The sum of the evidence as presented here is that the semantically 
acceptable scoring procedure is the most suitable for use With cloze tests. Apart 
from Unpractical consequences in the use of cloze tests, what are the implica- 
tions of this finding? 

It is possible to argue that the exact \v\)rd procedure is the best measure 
of sensitivity to discourse constraints, and that therefore the se.nantically ac- 
ceptable procedure is less sensitive to such, constraints. However, no evidence 
was found that different procedures measure different abilities or sensitivities, 
and both the exact word procedure and the gramatically correct procedure 
related to the semantically acceptable at about the same level. We must re- 
member, however, that we are dealing with the pseudo-random cloze procedure, 
in which many 'ems deleted— function words, for instance— are usually con- 
strained by the immediate environment, and are frequently only replaceable 
by the original word* deleted. In a cloze test as a whole, therefore, the effect of 
the difference between, the exact and the semantically acceptable procedures 
will be considerably reduced by these two features. The difference in context 
sensitivity, if any, will have no effect for immediately constrained items and 
there will in many cases only be on." acceptable restoration. This suggests that 
cloze tests are relatively insensitive to discourse constraints, and that they are 
measures of somewhat lower-order skills. This suggestion is to some extent 
confirmed by the pattern of correlations with the English Language Battery, 
where the cloze related more closely to Grammar and Vocabulary than to 
fading Comprehension or even the dictation. The pseudo-random cloze test 
appears to be a useful measure of core proficiency in English as a Foreign Lan- 
guage, although the variation observed in the performance of different tests 
means that one needs to validate each cloze test—the cloze procedure does not 
produce an automatically valid measure of EFL proficiency. This variation in 
the validity of the test can, however, be reduced by using the semantically 
acceptable scoring procedure. 
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Testing in an EFL/ESP Context 

George E. Scholz and C«l*sto M. Scholz 

liutitut National d'Etectricite et dEtetronique 
Boumerdos, Algeria 



There has been much discussion concerning the level of language proficiency 
at which ESP (English for Special Purposes) can most efficiently and effec- 
tively be taught to non-native speakers. 

In an effort to find an vers to this question, instructors at the National 
Institute of Electricity and Electronics in Boumerdes, Algeria, administered 
eight tests to 50 students who had completed a sixteen week intensive EFL 
program. Four of the tests were of EFL contents 1) the grammar sections of 
the Michigan Proficiency Exam (A and B), 2) a 100-item multiple^hoioe 
listening comprehension exam, 3) 3 short EFL cloze passages and 4) 2 EFL 
dictations. Four tests were of ESP content, with passages submitted by the 
technical faculty and produced in a testing format by the EFL staff: 1) a 
fifty-item mulUplo-choice technical grammar test, 2) 8 reading passages 3) 
3 short ESP cloze passages and 4) 2 El ? dictations, All results were correlatedj 

and factor-analyzed. A . ..... 

It was found that tests of EFL content correlated significantly with tests 
of ESP content, indicating similarity of students' scores, regardless of content. 
Furthermore, the (actcr analysis revealed all te|ts o be similar m that they 
assessed general language ability. f t 

The following semester CELT Structure and Listening Tests were ad- 
ministered and the learners' final technical subject scores were collected. The 
scores were related to the students* previous EFL/ESP tests by regression 
analysis to determine the predictability of a learner's future technical per- 
formance. It was found that EFL tests predicted a student's EFL performance 
slightly better than the ESP tests, while the integrative tests of cloze and dicta- 
tion appeared to be better indicators of a learners ability to succeed in tech- 
nical subjects. 

English for Specific Purposes (ESP) enjoys a great prestige in ESL/FL. 
ESP rests on "a reputation for relative high success rates . . . compared with 
conventional teaching of English as a foreign language." (Strevens 1977c:3) In 
terms of establishing a coherent ESP curriculum with a learner-centered ap- 
proach, impressive work has been done in identifying the learners linguistic/ 
communicative needs (see, for example, Jones and Roe 1975; Mackay 1978; aud 
Munby 1977). Ewer (1975) provides an excellent discussion of the need for 
trained EST (English for Science and Technology) instructors. 

When evaluation is taken into account, however, very little has been done 
in assessing an ESP program. 
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It remains a major shortcoming of ESP that very little work has been done to devise 
fresh methods of testing, examining and assessment that match the new courses of 
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training. ESP teachers combine in rejecting as unsuitable all of the many existing 
tests in ESL. (British Council 1976) (Strevens 19771:129) 

The paucity of adequate evaluation of both learner and curriculum presents a 
serious drawback to ESP: If there is no evaluation of the teaching/Jearnin* to 
provide feedback and subsequent adjustment of the curriculum, then the cur- 
riculum is significantly handicapped, a fact which may be a source of frustration 
to both the instructor and the learner. 

The assumptions of the teaching/learning of ESP/EST are two: 1) The 
teaching/learning of EST/ESP involves both science and language. "Assumed 
shared knowledge, . . . presupposition. ... affects surface syntax in EST texts 
so drastically that language and subject matter cannot be discussed separately 
when the focus is on discourse." (Selinker and Trimble 1974:83) 2) ESP/EST 
follows a period of intensive general English in which the learner must gain a 
considerable command of English. This can be attested to by a 500+ score on 
the TOEFL or a similar proof of proficiency of the learner before studying 
ESP/EST In addition, many ESP/EST programs are designed for the post- 
secondary (tertiary) level of education, a time when the learners have com- 
pleted many years of English. ^ 

Given these, assumptions, what can an EFL/SL teacher, transformed into 
an ESP/EST instructor, do when his learners do not meet the proficiency criteria 
of assumption 2? What should the instructor do when the learners do not have 
a grasp of the science/technology being taught in the ESP class, let alone the 
language? Finally, questions within the scope of this paper are: 

1 ) What evaluative instruments should be used? 

2) What subject matter should the tests contain (general EFL or ESP)? 

3) What do those instruments measure — science or language? 

4) Which instrument provides a reasonable indicator of a learner's future 
performance in science and technology? 

5^ May ESP be introduced at an earlier level of a learner's English pro- 
ficiency than has previously been acknowledged? 

The lack of evaluative instruments in EST/ESP may be due to the con- 
troversy concerning the relationship of scientific English to general English and 
tc the teaching of subject-matter as well as English in ESP/ EST. The ESP/EST 
student is involved with "learning language and understanding science at one 
and the same time." (Boyd and Boyd 1978:25) This article reflects the effort to 
gain insight into the EFL/EST controversy as well as to provide satisfactory 
English evaluatory instruments in an ESP context. "The purpose of testing is 
always to render information to aid in making intelligent decisions about pos- 
sible courses of action." (Carroll 1972:314) To achieve these goals, the testing 
in this context has been the instrument of research. 

The EFL/ESP controversy listed above is further compounded when cur- 
rent testing re?ca r 'h is involved. Basically, the controversy in testing research 
centers around the conceptions of language and language learning. The criticism 
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directed against EFL teaching learning is that it does* not expand beyond the 
sentence level, i.e , it does not deal with the relationship of a particular sen- 
tence to a particular piece ot discourse. This focus of instruction is the heritage 
of structural linguistics, based upon the premise that language learning and 
ability, may be divided into separate (discrete) skills or components. "Discrete 
point analvsis necessarily breaks the elements of language apart and tries to 
teach them separately . . . with little or no attention lo the wav those elements 
interact in a larger context of communication.' (Oiler 1979b) Tests based on 
this premise may be railed discrete-point tests, which usually employ a multiple- 
choice format. "The most serious disadvantage of discrete-point tests in general 
is that thev fail (in most cases) to reflect actual language usage." (Oiler 1973a: 
185) 

The other overlapping but distinct view of language and language learning 
holds that "to teach a language is L, teach A student to eommunidje in real- 
life situations." (Oiler 1973a. 185) Placing a stress on communicative rather than 
discrete skill competence is an accurate contemporary view of the English lan- 
guage teaching field. Tests of language use in meaningful contexts are "integra- 
tive" (Carroll 1972) or "pragmatic" tests (Oiler 1979b). 

Cloze procedure and dictation are, two examples of integrative tests. As 
natural language is redundant, integrative/pragmatic tests exploit redundancy 
in a meaningful context. A cloze test reduces redundancy by a mechanical de- 
letion of every nth word, while a dictation provides reduced redundancy via 
distortion. Both tests challenge the learner's internalized grammar or underlying 
competence of a given language. Although cloze and dictation have generally 
been approved as tests of reading and listening respectively, both tests have 
l>een advocated as measuies of a learners overall language proficiency (see 
Aitken 1977. Oiler 1972a ). 

1. Working Hypotheses I 

As this study was exploratory in nature, a scries of working hypothtses were 
formed on the assumption that those hypotheses could lead to progress in ESP 
evaluation. 

The working hypotheses (WH) were as follows. 

WH-l: There is a significant difference between various tests of English 
ability employing EFL and ESP content. 

WH-2 Tests of English language ability employing ESP content measure 
a learner's science ability, not language ability. 

WH-3: lests of English language ability employing EFL content serve as 
better indicators of a learner's future EFL performance than tests having an 
ESP content. 

WH-4: Tests of English language ability using ESP content serve as better 
indicators of a Kirncr's future technical performance than English tests of EFL 
content. 
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WH-5: Integrative tests of English language using ESP serve as better 
indicators of a leu uer's future technical performance than discrete-point tests. 

2. Setting 

The tests developed for this study were administered to students at the 
Institut National d'Electricite et d'Electronique (INELEC) in Boumerdes, Al- 
geria, in February, 1977. INELEC is an Algerian institution unique in its at- 
tempt to carry on an entire program in English, which is considered ,a foreign 
language, rather than French, the primary language of higher instruction. The 
students selected as subjects for this study were those who had completed their 
first semester at INELEC. The first semester consisted of a 16-week intensive 
EFL session, totaling 480 hours. Included in the intensive semester were courses 
on the English of Mathematics (80 hours) and the English of Tools (80 hours). 
In addition, after eight weeks the students were given an introduction to elec- 
tricity (40 hours) and technical drawing (40 hours), courses which were taught 
by technical instructors in English. 

This setting proved interesting for five reasons: 1) Almost all subjects had 
either Arabic or Arabic/ Kabylie Berber as a first language, with French as a 
second language. The only exception was a student of Algerian parentage raised 
in France, who spoke French as a first language and Arabic as a second lan- 
guage. 2) All students had a relatively similar education in primary and sec- 
ondary institutions. 3) All subjects had some prior English language experience, 
-\ usually taught by non-native speakers with a traditional grammar orientation. 
Subjects could talk about grammar, but not use English for communicative 
purposes. Entrance tests classi'xd students at the false beginner or low inter- 
mediate level. 4) Students entering INELEC were selected in terms of scientific 
and mathematical ability but not English ability. 5) None of the subjects had 
had previous training in electrical technology ot engineering. 

3. Subjects 

Fifty students who had completed their first and second semester in Elec- 
? trical Technology were selected for this study. Although 82 students had taken 
the English tests at the end of the first semester, subjects were dropped who 
were not passed to the second semester, who changed fields of study or did not 
complete the battery of tests. 

4. Tests Instruments 
(see Figure 1) 

All the* English tests used may be classified into the categories of grammar, 
listening and reading. With the exception of the Michigan Test of Language 
Proficiency (MTELP)-Grammar section (A and B-Revised) and the teacher 
constructed listening comprehension test, all of the tests may be found in 
Appendix A. The following is a description of the eight tests which were ad- 
ministered. 
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Figure 1 
Tests 





Context 


Items 


Type 


Context 


items 


Tun* 


SUll/Focus 
Tested: 


EFL 






ESP/EST 






Crammer: 


Mich A+B 


80 


discrete- 
point 


Tech 

Grammar* 


50 


discrete- 
poir.t 


Listening: 


Listening 


100 


discrete- 
point 










Dictation* 


301 


integrative 


Dictation* 


242 


integrative 


Reeding: 


Cloze* 


06 


integrative 


Cloze* 


79 


integrative 



Reading 

Comprehension* 32 discrete- 
point 

* These tnU may b% found in Appendix A 

41 EFL ) 

1) MTELP-Grammar (80 items) The grammar section of the MTELP, Form 
A (1961) and Ferm B-Revised (1965) were used. 

2) Listening Comprehension (100 items) - The listening comprehension was 
teacher constructed. Using a multiple-choice format utterances required either 
an appropriate response or paraphrase. This type of listening test was similar 
to most standardized tests of listening comprehension. 

3) Cloze Procedure (66 items) A set of three cloze passages with seventh 
wdrd deletions were used. The passages were selected from grade school and 
junior high readings for native English speakers. Scoring was based on the 
exact and acceptable word method. 

4) Dictation (310 items) Two dictations, taken from grade school— junior high 
reading, were administered. Scoring was based on the exact word method, one 
point for each correct word, inclusive of minor spelling errors. 

4.2 ESP/EST. In developing the EST tests, the technical faculty was asked to 
submit two passages from their own coursework which they felt the subjects 
would be able to understand in terms of language and not specific scientific/ 
technical concepts. From those passages received, items were selected to con- 
struct a discrete-point grammar test, two cloze passages and two dictations. 

1) Technical Grammar (50 items) This test was constructed by the English 
teachers who compiled a list of structures taught in the intensive semester 
and then matching a technical sentence carrying the item. The structure to be 
tested was then omitted and placed in a multiple-choice format with distrac- 
tors created by the English instrudbrs. Due to a limited number of technical 
passages received by the English staff, a few items in the grammar test were 
solely the creation of the English faculty. Those items, however, contained tech- 
nical classroom content. 

2) ESP/EJT Reading Comprehension (32 items) Eight short reading passages, 
submitted by an English of Tools instructor, were administered. Each passage 
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was followed by four comprehension questions in a multiple-choice format. 

3) ESP/EST Cloze Procedure (79 items) Two passages were selected from 
those submitted by the technical faculty and were administered with a seventh 
word deletion. Another passage, submitted by an English of Tools instructor, 
was included. Scoring was by the exact and acceptable word method. 

4) ESP/EST Dictation (242 items) Two passages were selected from those 
submitted by the technical faculty. Scoring was based on one point for each 
correct word; inclusive of minor spelling errors. 

5. Procedure 

All subjects were given various tests at different times during the examina- 
tion week of the first semester. The logistics of providing one test at a time 
to all subjects proved impossible. All tests and answers were collected at the 
end of each test period. AH subjects had practice in cloze procedure and dicta* 
tion prior to the examination. 

At the end of the subjects' second semester, final scores for the subjects' 
technical courses were collected. The technical courses were Mathematics, Tech- 
nical Drawing and £-C Circuitry. The subjects were also given the Compre- 
hensive English Laflfuage Test (CELT): Listening (Form L-A, 1970) and 
Structure. 

6. Statistical Procedure 

All results of the first semester were correlated and factor analyzed, using 
Pearson product moment correlation (r) and principal component analysis. The 
use of r and factor analysis permitted testing of WH-1 and WH-2. 

A correlation indicates the associative relationship (if any) between two 
variables. The correlation squared (r 2 ) indicates the amount of variance shared 
by two variables, "Factor analysis is uiie of the statistical techniques for examin- 
ing . . . patterns of correlation." (Oiler and Hinofotis 1976:1) Highly correlated 
variable* form (or load on) a factor. It may be hypothesized that the variables 
on a factor share a common or underlying source. What is of importance is the 
amount of loading of each variable on a factor, which indicates the factor's 
importance to a variable. Using a principal components solution, if a general or 
common factor, G, indicating similarit) between variables, is to be accepted, 
the product between two variable loadings (predicted r) should equal the 
actual correlation between them. The remaining (residual) variance between 
the tw») variables should be near or at zero. 

To test WH-3, WH-4 and WH-5 the scores of the eight tests administered 
at the end of the first semester were conelated with the technical and English 
test scores of the second semester. With each of the scores of the second semes- 
ter serving as dependent variables and each of the English tests of the first 
semester serving as independent variables, a simple linear regression was cal- 
culated to determine which one of the eight English tests would be the best 
predictor for each of the dependent variables. 
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Given the correlation between two variables, simple regression analysis 
provides the be?t prediction possible. (Kerlinger 1973:604) From a one-way 
analysis of variance, the F-ratio may be found indicating the statistical signif- 
icance of the regression, of predicting Y (the dependent variable) from X (the 
independent variable). 

7. Re Ms and Discussion 

Tht mean, standard deviation, standaid error of measurement and reliabil- 
ity estimates of all tests may be found in Table 1. The correlation matrix for 



TABLE 1 



}. Semester I 









Standard 


Standard Error 




Test 


Items 


Mean 


Deviation 


of Measurement 


Reliability (r„) # 


Mich Grammar 


80 


41.84 


7\37~~~ 


4.42 


.64 


Listening 


100 


64.88 


925 


4.71 


.74 


Cloze EFL 


66 


39.04 


8.88 


4.87 


.81 


Dictation EFL 


301 


130.22 


31.56 


8.41 


.93 


Total EFL 


547 


272.40 


51.37 


11.50 


.95 


Tech Grammar 


50 


29.38 


6.71 


3.38 


.75 


Reading Comp EST 


32 


17.96 


4.83 


2.72 


.68 


Cloze EJST. 
Dictatkm EST 


79 
242 


48.70 
136.48 


10.12 
27.30 


4.20 
7 53 


.83 
92 


Total EST 


403 


23218 


42.14 


9 71 


.95 


11. Semester 11 












English 










.67 


CEIT Structure 


75 


41.84 


7.37 


4.23 


CELT Listening 


50 


26 44 


5.36 


3.47 


.58 


CELT Total 


125 


a3.48 > 


1102 


5 SI 


.75 



Technical 
Math * 
DC Circuits 
Technical Drawing 
Total Technolo©' 



100 
100 
100 
300 



53 98 
58.74 
86.18 
198 96 



18 08 
9.57 
16.61 
37.54 



4.78 
4.88 
2.88 
7.51 



* Kudar Richardson relianihtj approximation taken from lntin<turtu>n to R<#rairh i 
L Jacobs A Raiavjeh \r\v \ork Holt. Hmnehart & Winston. p 2G7 208 



.93 
.74 
.97 

.96 
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TABLE 2a 

Correlation Matrix of Four EFL and Four ESP/EST tests, 
Observed r above diagonal and Common Variance ( r 2 ) below diagonal 



1 Mich Grammar 

2 Listening 

3 Cloze EFL * 

4 Dictation EFL 

5 Total EFL 

6 Tech Grammar 

7 Reading Clomp 

EST 
- 8 Clow- EST 
9 Dictation EST 
10 Total EST 

W'hh than 00) ttni^HM itifliiali-fl 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 




~~62 


60 


62 


.79 


79 


53 


.63 


.62 


.73 


38 




53 


44 


64 


57 


63 


51 


64 


,70 


36 


28 




.43 


57 


66 


50 


.69 


.51 


.65 








( 002) 
















.19 


18 


91 


57 


30 


.68 


.77 


.77 














( 033) 








62 


,41 


32 


83 




76 


50 


,78 


.81 


.88 


62 


32 


.41 


32 


.58 




58 


72 


61 


.78 


28 


40 


25 


09 


.25 


34 




47 


.43 


.60 


















( 002) 




.40 


26 


.48 


46 


61 


52 


22 




66 


83 


,38 


41 


.26 


.59 


66 


M 


18 


44 




.95 


53 


49 


42 


.59 


.77 


61 


36 


69 


90 





V 
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the EFL/ESP tests may be found in "Jable 2a. Table 2b Contains the to. ation 
ipatrix of technical scores. Table 3a contains the factor analysis and calculated 
loading of the EFL/ESP tests. Table 3b contains a comparison of predicted * 
correlations from the factor loading with the actual correlations. Table 3c con- 
tains the remaining vaiiunce not accounted for by the factpi loadings. u ,\ 
The correlations found in Table 2a reveal that there is a good deal of 
shared variance between the Total EFL and Total ESP tests (r = .88, r = .77). 
This high correlation indicates that there is little significance attributed to con- 
tent with- regard to a subject's score. It apr)(ears that EFL tests of one skill 
generally correlate more highly with ESP tests of the same skill than other tests 

TABL. 2b 

Correlation Matrix of Technical Scores 
Observed r above diagonal and Common 
Variance (r ) below diagonal. (N — 50) 





1 


2 


3 


4 


I Math 




.61 


60 


90 


2 DC Cir nits 


37 




.45 


.75 


3 Technical Draw 


•36 


20 




.85 


4 Total Tech 


81 


.56 


72 




p is h-ss than nol 











TABLE 3a 

I Principal Factor Solution with iterations) for the four 
EFL tests and four EST tests (N — 50 Algerian .subjects) 



Tests 



Loadings on C Factor* 



Michigan Grammar 

Listening 

Cloze EFL 

Dictation EFL 

Technical Grammar 

Reading Comprehension EST 

Owe EST 

Dictation JEST 



.83 69 

.73 53 

.73 ,53 

.72 .52 

.86 74 

63 .40 

82 67 

.80 .64 

Eigen value 4.72 



• AnottntH for \Q\) r /c of ♦}»♦■ total %ariam , e in the factor matrix 

II Number of Factors in Vaninax Rotation L»ss than two, so 
Rotation Bypassed 

TABLE 3b 

Correlation Matrix ( above the diagonal ) and Predicted Correlations Derived from 
Respective Pioducts of Loadings cn G (below diagonal). 





I ■ 


2 


1 Mich Gr-.rn 






2 Tech i.i miliar 


71 




3 Reading Comp 


52 


54 


EST 


• 




4 'Listening 


61 




5 Clo/e EFL 


.61 


.63 


6 Cloze EST 


38 


.71 


7 Dictation E*T 


66 


69 


8 Dictation EFL 


.60 . 


62 




6 N 

63 
.72 
.47 

.51 
69 

.66 
.59 



.62 
.61 
43 

64 
51 

,66 

58 



.oa- 

57 

30 

.44 

43 
.68 
.77 
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a ABLE 3c 



Residual Matrix with G Loading Partialed Out ( mean of 
absolute values = .0625 ) . Observed r minus Product of 
Loading on C. 



1 



1 Mich Gram .08 .0i 

m 2 Tech Grammar .04 
m 3 Reading Comp 
EST 

> 4 Listening 

5 Cloze EFL 

6 Ooze EST 

7 Dictation ~°T 

8 Dictation ErL . 



4 


5 


6 


7 


8 


.01 
-.06 
.17 


-.01 
.03 
.04 


-.05 
.01 
-.05 


-.04 
-.08 
-.07 


.02 
-.05 
-.15 




.00 


-.J9 
.09 


-.06 
-.07 
.00 


-.09 
-.10 
.09 
.19 



of different skills or content. The Michigan Grammar test correlates most closely 
with the Technical Grammar test (r^= .79). Listening Comprehension correlates 
highly with Dictation EST (r=.64). Cloze EFL correlates most highly with 
Cloze EST (r = .69). The test with the poorest correlations appears to be the 
ReadinfeComprehension-EST test, which also has a low reliability. 

The factor analysis reveals a single unitary factor accounting for 100% of 
the variance in the total matrix. All the variables may then be hypothesized 
as tests of general language ability. All variables load highly on the G facte/ 
with the exception of the Reading Comprehension test (h 2 = .40). The residual 
variance (Table 3c) indicates that only a small" amount of variance remains 
unaccounted for by thd G factor. 

The factor analysis does not reveal different skill areas with unique vari- 
ance. If this were the case, various factors would be produced, corresponding 
to a skill. In this study, the tests of grammar, listening and reading would 
produce three different factors, furthermore, if there were also a difference 
between language content and skill, there would be six factors: one for EFL 
Grammar, one for Tech Grammar, and so on. If the EST tests measured science 
and not language ability, then there would be two factors: one for science, 
containing the ESP tests and one for language, containing the EFL tests. The 
data, however, do not support aoy f of these conditions. Rather, they appear to 
support the unitary competence hypothesis (OUer and Hinofotis 1976; Scholz, 
Hendricks, et al. 1979) that "The components of language competence, what- 
ever they may be, may function more or less similarly in any language-based 
task." (Oiler and Hinofotis 1976:2) 

Jable 4a contains fr* correlations and F-ratios of the regression analysis of 
the English CELT tests dn the eight EFL/ESP tests. Table 4b contains the 
correlations and F-ratios of the regression analysis of the technical scores on the 
English tests. 

AU regressions of the English tests were significant at the .01 level with 
tne exception, of the CELT Structure Test on Cloze-EST (p <.05). Tests of 
grammar correlated most highly with the CELT Structure test, (Table 4a-l) 
although Michigan Grammar correlated slightly higher than the Tech Grammar. 
© sts of listening correlated most highly with the CELT Listening test, (Table 
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TABLE 4a 

Linear Regression Analysis of English Tests (N=50) 
4a- 1 Dependent Variable: CELT Structure 



Source 


r 


r 3 


Sum Squares- 


Sum Squares- 


F 


(in order of correlation) 






Regression (df= 


1) itesidual (df=48) 




i_ i^ucn orammar 


.69 


.48 


1006 


1326 


39.*J7* 


2 Tech Grammar 


.69 


.47 


1014 


1306 


37*7* 


3 Listening Comp 


.52 


.27 


504 


1818 


13.31 • 


4 Dictation EST 


.48 


.23 




1 QOA 

looU 


lo.ol 


5 Cloze EFL 


.46 


.21 


510 


1556 


15.73* 


6 Dictation EFL 


.45 


.20 


640 


1948 


15.77* 


7 Cloze EST 


.41 


.16 


185 


1374 


6.46* * 


8 Reading Comp EST 


.34 


.11 


065 


2216 


20.90* 


4a*2 Dependent Variable: 


CELT Listening 




\ 




Source 


r 


r* 


Sum Squares- 


Sum Squares- 


F* 


(in order of correlation) 






Regression ( df— 


1) Residua] (df=48) 





1 Listening Comp 

2 Dictation EST 

3 Dictation EFL 

4 Cloze EST 

5 Mich Grammar 

6 Tech Grammar 

7 Reading Comp EST 

8 Cloze EFL 



.68 
.63 
.62 
.52 
.50 
.48 
.43 
.40 



.47 
.40 

.39 
.28 
.25 
.23 
.18 
.16 



583 
456 
556 
651 
262 
^75 
4ul 
342 



784 
858 
860 
1056 
1040} 
1100 
1146 
1186 



35.69 
25.21 
31.03 
29.59 
12.09 
16.36 
16.80 
13.84 



k p li leu than .01 
* p it Uu than 05 



4a-3 Dependent Variable; CELT Total 


Source 


r 


r 2 


Sum Squares- 


Sum Squares- 


F* 


(in order of correlation) 






Regression • ( df= 


1) Residual (df=48) 




1 Mich Grammar 


.69 


.47 


3878 


. 3152 


59.06 


2 Tech Grammar 


.67 


.45 


2158 


3438 


30.13 


3 Listening Comp 


.66 


.44 


2829 


3392 


40.03 


4 Dictation EST 


.62 


.38 


2246 


3700 A ' 


29.14 


5 Dictation EFL 


.59 


.35 


2467 


3770 


31.41 


6 Cbze EST 


.52 


.27 


1441 


4372 


15.82" 


7 Cloze EFL 


,.49 


.24 


i040 


4616 


20.17 


8 Reading Comp EST 


.42 


.18 


952 


5204 


8.78 


* p is Um than .01 
















TA*LE 4b 






Linear Regression Analysis 


o 


onical Course Grades (N=50) 




4b- 1 Dependent Variable: 


Math 










Source 


r 


r 2 


Sum Squares- 


Sum Squares- 


F " 


(in order of correlation) 






Regression (df= 


1) Residual (df=48) 





1 Cloze EST 

2 Reading Comp EST 

3 Cloze EFL 

4 Dictation EFL 

5 Dictation EST 

6 Mich Grammar 

7 Tech Grammar 

8 Listening Comp 

+ tin than .01 



.23 


.05 


823 


14,764* 


2,65 


.09 


-+ 


-406 


15,926 




.07 


-+ 


194 


15,994 




.07 


-+ 


-250 


15,878 




.07 


-+ 


180 


15,338 




.06 


-+ 


69 


15,824 




.05 


-+ 


-57 


16,096 




.01 


-4 


108 


16,010 
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4a-2) although the EFL Listening Comprehension correlated slightly higher 
than Dictation-EST. In regard to the Total CELT scores, (Table 4a-3) both 
discrete-poiiit and integrative tests, regardless of EFL or ESP content, appeared 
to serve as indicators of a subject's future English ability. 

With respect to the regression analysis of .e technical scores on the 
eight English tests (Table 4b), the picture is not so clear. The CIoze-EST test 
correlates the highest of all the English tests with the Technical scores, with 
the exception of D-C Circuitry. However, the regression analysis was not sig- 
nificant at the .05 level. Cloze and dictation, integrative tests, served as slightly 
better indicators than the discrete-point tests of grammar. 

On*' problem with the regression analysis may be that the technical scores 
were not accurate representations of the subjects* technical performance. Other 
factors besides technical ability may have been taken into account in the final 
technical score calculations. The assumption that knowledge of English would 
have no bearing at all in an entirely English academic program seems tenuous. 

To explore further the relationship of EI?L/ESP to future technical per- 
formance the subjects with the highest scores in technology were examined. 
The criterion was that the subject had to have scored one standard deviation 
or better on three or more of the . technical scores. 1 Nine subjects met the 
criterion. The results may r^e found in Table 5. Although very few of the re- 
gressions are statistically significant at the .05 level, the correlations are much 
larger. Cloze-EST still maintains the highest correlation with the technical 
scores. Both grammar tests and the Listening Comprehension test do not appear 
to predict as well as some of the integrative tests of cloze and dictation. In 
terms of technical performance, it appears that in the case of the regression 
of all subjects (Table 4b-4) and of the top subjects (.Table 5d) Cloze-EST and 
Dictation-EFL are the best predictors of the eight English tests. 

S. Conclusions and Recommendations 

Tests containing either EFL or ESP measure general language ability. The 
ESP/EST tests used in this study assess language and not science. EFL tests 
are slightly better, indicators of a learner's l'uture EFL performance than ESP 
tests. 

In terms of predicting technical scores, more research is needed to deter- 
mine EFL/ESP tests that are able to indicate future technical performance at 
a statistically significant level. This study speculates that integrative tests may 
serve as better indicators of technical performance than discrete-point tests. An 
ijitegrative/pragmatic test in an EST contert is probably more valid than a 
discrete-point test, as "the validity of the test can be established not solely on 
the basis of whether it appears to involve a good sample of the English language 
but more on the basis of whether it predicts success in the learning tasks and 
social situations to which the examinees will he exposed, (emphasis mine)" 
(Carroll 1972:319)' 

1 One exception was Technical Drawing where the criterion was a score of 95 or better. 
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4b-2 Dependent Variable: D-C Circuitry 



Source 
(in order of correlation) 


r 


r* 


Sum Squares- 
Regression ( df= 


Sum So i m r#*Jt- 

1) Residual (df=48) 


F 


1 Urtening Comp 


—.25 


.06 


284 


4214 


3.23 


2 Cloze EST 


.07 


-+ 


256 


4438 


2.77 


3 Cloze EFL 


.06 


-+ 


143 


4466 


1.54 


4 Reading Comp EST 


-.03 


-+ 


-107 


4496 




5 Mich Grammar 


-.03 


-+ 


361 


4446 


3.00 


6 Tech Grammar 


-.03 


L 


361 


4454 


3.89 


7 Dictation EST 


-.03 


-+ 


478 


4418 


5.19" 


8 Dictation EFL 


-.02 


-+ 


1180 


4438 


12.76* 



4b-3 Dependent' Variable- 


Technical 


Drawing 








Source 


r 


r 2 


Sum Squares- 


Sum Squares- 


F 


(in order of correlation) 




Regression ( cL c — 


1) Residual <df=48) 




1 Cloze EST 


.31 


.09 


366 


12,284 


1.43 


2 Dictation EFL 


.27 


.07 


1526 


12,336 


5.94" 


3 Cloze EFL 


.20 


.04 


1400 


13,052 


5.15" 


4 Mich Grammar 


.18 


.03 


115 « 


12,900 




5 Dictation EST 


.11 


.01 


10 


20,558 




6 Tech Grammar 


.07 


-+ 


69 


13,526 




7 Reading Comp EST 


-.08 


-+ 


103 


13,446 




8 Listening Comp 


-.04 


-+ 


27 


13,360 





'pli lata than 
** p ii tea than 
+ laaa than .01 



.01 
.05 



4b-4 Dependent Variable: Total Technology 



Source 
(in order of correlation) 



r 


r J 


Sum Squares- 
Regression (df= 


Sum Squares- 
1) Residual (df=48) 


F 


.27 


.07 


1259 


62,814 


.96 


.15 


.02 


1195 


86,976 


.86 


.14 


.02 


1436 


. 67,198 


1.03 


.11 


.01 


1957 


68,446 


1.37 


.08 


-+ 


-1216 


68,232 




-.07 


-+ 


392 


68,210 




.05 


-+ 


605 


68,895 




-.0001 


-+ 


796 


69,068 





1 Cloze EST 

2 Dictation EFL 
Cloze EFL 
Mich Grammar 
Dictation EST 
Listening Comp 
Tech Grammar 



8 Reading Com p EST 

+ bat than .01 

TABLE 5 

Linear Regression Analysis of Technical Scores of Top Subjects 
in Technical Courses (N=9) 

5t Dependent Variable; Math 



Source 
(in order of correlation) 



1 Cloze EST 

2 Dictation EFL 

3 Reading Comp EST 

4 Dictation EST 

5 Tech Grammar 

6 Listening Comp 

7 Mich Grammar 

8 Cloze EFL 



r 


r* 


Sum Squares- 
Regression ( tlf= 


Sum Squares- 
1) Residual (df=48) 


F 


.61 


.37 


240 


428 


3.93 


.55 


.31 


202 


45? 


3.13 


.37 


.14 


249 


552 


3.16 


.31 


.10 


368 


574 


4.49 


.30 


.09 


52 


606 




.24 


.06 


34 


578 




-.23 


.05 


-123 


600 




-.005 


0 


312 


644 


3.39 



The integrative tests of EST-Cloze andTbictation-EFL may indicate that, 
in a technical context, listening and reading are significant language skills. It 
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9b Dependent Variable: D C Circuitry 



Source 
(In older of correlation) 

I 
Z 
3 
4 
5 
0 
7 



Sum Squares- Sum Squares- F 
Regression fdf=l) Residual (df=48) 



Close EST 
Ooze EFL 
Dictation EST 
Reading Comp EST 
Tech Grammar 
Mich Grammar 
Listening Comp 



8 Dictation EFL 



.63 
.58 
.58 
.56 
.36 
.31 
.30 
.06 



.39 
.34 
.31 
.31 
.13 
.10 
.09 
.03. 



50 
185 

-226 
151 
12 

-101 
276 



84 
102 
98 
98 
126 
128 
114 
150 



4.17 
12.70* 



8.38" 
1188* 



* » It Has taaa ,01 
** p to toss tkaa .05 

So Dependent Variable: Technical Drawing 



Source 
(In order of correlation) 



r t* Sum Squares* * Sum Squares- 
Regression (df=l) Residual (df=48) 



F 



1 Cloze EST 


-.70 


.49 


223 


34 


41.91* 


2 Tech Grammar 


-.63 


.39 


-101 


42 




3 Dictation EST 


-.54 


.29 


-211 


52 


26\40 # 


4 Listening Comp 


-.50 


.25 


205 


60 


23.62* 


5 Mich Grammar 


-.50 


.25 


-171 


52 




6 Dictation EFL 


-.22 


.05 


195 


, 70 


19*50* 


7 Oo~* EFL 


-.21 


.04 


4 


66 




8 Reading Comp EST 


-.05 


-+ 


-380 


66 




5d Dependent Variable: 


Total Technology 








Source 
(In order of correlation) 


.r . 


r 1 Sum Squares- Sum Squares- F 
Regression (df-1) Residual (df=48) 



Close £ST 
IWgding Comp EST 
Dictation EFL 
Dictation EST 



Mich Grammar 
Tech Grammar 
Ckne EFL 



.51 
.42 
.40 
.29 
.28 
-21 
.20 
.07 



J2B 
.18 
.16 
.09 
.08 
.05 
.04 
-+ 



159 
109 
89 
47 
-430 
-444 
-456 
5 



384 

512 

506 

564 

550 

582 

602. 

616 



2.90 
1.49 
1.25 



*p to toss Uu 
+ toss taaa. 01 



.01 



may be that while the learner may have to read his technical books, the tech- 
nical instructor simplifies his scientific information into everyday English. As 
Michael Collins of the EFL staff of the University of Petroleum and Minerals 
asserts after observing, the language of science lectures at his university, 

The need for communication forces the science teacher to explain difficult or un- 
familiar terms and concepts by. reference to everyday examples in everyday lan- 
guage and this is che kind of language he uses most of the time. ( Boyd and Boyd 
W8:*5). 

Needless to say, th? development of reliable and valid tests for learners 
in an ESP context remain*, necessary. It is hoped that the statistical analysis 
and ideas in this paper may help this development. 
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TECHNICAL GRAMMAR 



1. **You understand electronic theory very 
we!" 

"I ihould. I „_ it ever since I 

begin INELEC." 

a) haven't studied 

b) bid. studied 

c) have been studying 

d) had been studying 

2. The drawing must convey the informa- 
tion or the part will not be 

made correctly. 

a) accurate and complete 

b) accuracy 

c) accurately and completely 

d) as accurate and complete 

3. You 



what happens when 



safety precautions aren't followed. 



■ ) 

b) 

c) 
d) 



have ever seen 
have never seen 
are usual y seen 
can be seen 



4. The charge which might exist could be 

. positive or negative depending 

on which materia! gives up electrons more 
easily. 

a) more 

h) either \ 

c) neither 

d) such 

5. "Carefully inspect your circuit, checking 
the polarity of the leads/' He sai d - . . 

a) to inspect your circuit carefully 

b) inspect your circuit carerVfly 

c) you inspect your circuit carefully 

d) not inspect your circuit carefully 

6. "Don't touch that wire!" The wire is 
— dangerous to touch. 

a) much 
b> .very 

c) such 

d) too 

7. The word electronics derives from the 

electron. Electronics can Jefined 

to include all applications of electricity 
flowing in a vacuum. 

a) for 

b) be 

c) have 

d) been 

9 
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8. Ahmed 



seen a voltmeter be- 



fore he came to 1NELEC. 

a) had not 

b) has never 

c) hadn't been 

d) hasn't 

9. When working on printed circuits, too 

much heat — . soften the plastic 

form and cause damage. 

a) has 

b) can 

c) must 

d) has to 

10. Unless the student has a clear mental 

picture of a letter, he make the 

letter correctly. 

a ) should 

b) shouldn't 

c) can't 

d) can 

11. Lithium doesn't have six electrons in 

its second shell, and does 

beryllium. 

a) either 

b) also 

c) neither 

d) so 




glass rod supports a frame of 
holds up a very thin, 
filament. 



coiled wire, cal 

a) who 

b) whose 

c) that 

d) these 



13. Rosin-core flux is the solder most often 

used. Scientists, . though that fiax 

is not a substitute for cleaning the metals 
to be soldered. 



a) 

b) 
*c) 
d) 



have noted 
have been noted 
has noted 
have being noted 



14. The secret in soldering is to heat the 

joint, not the solder. When the Joint is 

, . to melt the solder, it forms a cover 

without any air spaces. 

a; too cold 
b) enough hot 



! ?5 
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TECHNICAL GRAMMAR (continued) 

c) hot enough 

d) very hot 

15. In todays mass production techniques, 

thousands of identical parts 

made from one drawing. 

a) should be 

b) can't be 

c) must be 

d) may be 

16. If the wire broken at any 

point, electrons would build up at the end 
of the wire that is connected to the nega- 
tive side of the battery. , 



») 

b) 
c) 
d) 



were 
is 

has 
had 



(Continued) 

TECHNICAL GRAMMAR (continued) 

the pointer would deflect in the reverse 
direction. 

a ) reverse 

b ) reversing 

c ) reverses 

d ) reversed 

22. The student put down 

the compass when the technical drawing 
teacher told him to stop. 

a) has already 
h) has Just 
v) had just 
d) has never 

23. "What kind of meter are you going to 

use?" 

"I'd like to use something — - the 

one you are using." 



17. "How can the current be measured?" a) that 

1>) like 

c) such 

d) as 



The instructor asked 



a) how the current could be measured 

b) if the current could be measured 

c) how could the current be measured 

d) if the current can be measured 

18. If two or more views are required, 
then the question of logical arrangement 

of the views arises. drawings 

have three views of the object. 

a) most 

b) least 

c) few of 

d) little 

19. Good drafting letters are well-propor- 
tioned. For enample, the cross bar of the 

letter A at one-third of the 

height of the letter. 

a) should be 

b) can't have 

c) -won't be 

d) might be 

20. The legibility of the drawing often 
how well the letters are made. 



a) 

b) 
c) 
d) 



decides 
is like 
depends on 
applies to 



ERLC 



21. The polarities on the meter terminal 
must be observed to obtain a positive meter 
reading. If the connections were — 




24. The mathematical proof 
orems of geometry are not 
portant to the draftsman 
application of these theorems *n practical 
problems. 

a) as - - as 
h ) more - — of 

c) most — of 

d) so . that 

25. You learned electronic theory- 

You are now ready to put it into practice. 

a) have not 

b) have yet 

c ) hadn't 

d) have already 

26. "Don't turn on the power before you 
close the switch/' The directions said - 



a) don't turn on the power 

h) turn on the power 

c) not to turn on the power 

d ) to not turn on the power 

27. If you were to bring two magnets to- 
gether with the north poles facing each 
other, you .„ - feel a force of re- 

pulsion between the poles. 

a) will 
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b) would 

c) won't 

d) had 

28. One student works hard in his lab 
class. The other student works. 

a) hard 

b) hardly 

c) completely 

d) doesn't 

29. Before the electric light bulb and its 
contents are sealed up tight, all the air 
~ out and a special gas put in. 

a) had been taken 

b) are taken 

c) is taken 

d) will have been taken 

30. The two types are acid flux and rosin 
flux. Acid flux is , — active in clean- 
ing metals than the other. 

a) least 

b) as 

c) more 

d) that 

31. "Are you going to put on your safety 
glasses? 1 ' 

He asked .. 

a) if were you going to put on your 
glasses 

b) where you are going to put on your 
glasses 

c) are you going to put on your glasses 

d) if you were going to put on your 
glasses 

32. The atoms of conductors have onb 1 

or 2 valence electrons. If you — 

at the atomic table, you can identify the 
good conductors. 

a) looked 

h) had looked 

c) look 

d) have been looking 

33. Neon is a naturally stable element, and 
is helium. 

a) so 

b) neither 

c) either 

d) such 

34. A certain type of ohmmeter must be 
zeroed each time you change the range. 

O 

ERLC 



When the adjustment deflect the 

pointer all the way to zero, it usually 
means the dry cells must be replaced. 

a) should not 
h) might 

c) cannot 

d) had better 

35. Recently man a new form 

of power, atomic power. 

a) areV>eing discovered 

b) had discovered 

c) has discovered 

d) has been discovered 

36. "Did you measure the resistance?" 

"No. I to do it during- the lab 

hour, but I didn't have time." 

a) hadn't planning 

b) had been planning 

c) have planned 

(1) had been planned. 

37. Voltage by using a voltmeter. 

a ) measures 
h) is measuring 
c) had measured 
<1) is measured 

38. A student would have difficulty mak- 
ing a letter correctly , a clear 

mental picture of the letter. 

a) if he has 

h) if he had 

c) unless he had 

d) if he doesn' have 

39. Atoms with only one valence electron 

are the best electrical conductors, 

copper is a good conductor. 

a ) because 

b) so 

c) as 

d) but 

40. The draftsman is — . important 
person that his mistakes cause his company 
to loose money. 

a) an 

b) an such 

c) such an 

d) so 

41. Electrons in the outei orbits of an 
atom are "free" electrons and may be 
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easily forced from their orbits. They are 

attracted to the nucleus by 

force than electrons whose orbits are near 
the nucleus. 

a) less 

b) lesser 

c) greater 

d) more 

42. When the big block of ice that 

Europe in the*fae age began to 

melt, forests began to appear. 

a) have covered 

b) had been? covered 

c) had being covered 

d) had covered 

43. The VOM is 'ised to measure voltage, 
resistance and current. It is called a Multi- 
meter it has multiple uses. 

a) while 

b) when 

c) so 

d) because 

44/ A little knowledge can be a dangerous 
thing and if something can go wrong, it 



a) will 

b) should 

c) had 

d) would 

45. Working with electr.uty is „ 

dangerous that the safety rules should be 
followed at all times. 

a) so 

b) very 

c) too 

d) such 

DICTATION 



46. Technical drawings that are not clear 

and accurate are difficult to read; , 

a good draftsman draws cleanly and ac- 
curately. 

a) when 

b) while 

c) because 

d ) therefore 

47. Thomas Edison invented the tight bulb 

in 1879. Since then, men many 

new electrical improvements, 

a) made 

b) have made 

c) is made 

d) have been made 

48. ^"he American system place the top 
view above the front view. The European 
system, does the opposite. It places the top 
view the front view. 

a) on 

b) below 

c ) over 

d) in 

49. "Why can't I use this meter?" 

"Because its that doesn't work 

now." 

a) the only one 

b) only one 

c) only of one 

d) only the one 

50. The directions say to draw an arc on 
either side of the triangle. The tirangle is 
the arcs. 

a ) around 

b) between 

c) right of 

d) left of 



EST 1. A volt is the unit of electrical pressure or potential./ Vt !tage is measured by using a 
voltmeter/ Voltmeters have a high internal resistance/ and are always corrected in parallel/ 
with a circuit or component such as a resistor/ The polarities marked on the meter terminals/ 
must be observed to obtain a positive meter reading./ If the connections are reversed/ fre 
pointer will deflect in the reverse direction./ The ampere is the unit of electric current./ 
Current is measured by using an ammeter./ Ammeters have low internal resistance/ and 
are always connected in series/ with a circuit or component such as a resistor./ 
£57 2. Potential difference is necessary/ to produce electrical current./ T«te number of free 
electrons that can be forced/ to drift through the wire to produce the moving charge/ de- 
pends upon the amount of potential difference across the wire./ With more applied voltage 
the forces of attraction/ and repukti a can make more free electrons drift/ producing mor 
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charge in motion./ A larger amour t of charge moving with the same speed/ means a higher 
value of current./ Less applied voltage across the sarr» " wire/ results m a smaller amount of 
charge in motion/ which is a smaller value of current./ With zero potential difference across 
the wire/ there is no current./ 

EFL 1. Every school child today knows how to tell the time./ It seems hard to believe that 
early man/ had no way to tell time./ In order to tell when things had happened/ men had 
to learn to measure the passing of time./ Men learned to measure a year by the time/ that 
it took for the earth to go through the seasons./ They learned to measure a day/ by the 
time it took the sun to travel across the sky./ Long after men could tell one day from an- 
other/ they could still not measure the time within a day./ Now we can measure our days in 
seconds, minutes and hours./ We have clocks to tell us the time of day./ We can measure 
our years in days, weeks and months./ We have calendars to tell us the time of year./ 
EFL 2. In cities in the United States/ there are clocks in most stores, factories and other 
buildings./ Radio announcers ?ive the correct time during the day./ People here think that 
it is important to know the time./ Most Americans have watches./ They want to do certain 
things at certain times./ They don't want to be late./ Time is not so important to people 
everywhere./ Suppose you visit a country m South America./ You would find that people 
living there do not like to rush./ If you had an appointment with someone/ the other per- 
son would probably be late./ He would not want to arrive on time./ In South America/ 
even the radio programs may not begin right on time./ Nor do the men on the radio think 
it important to announce the exact time./ In South America many people think of a clock 
as a machine./ They feel that a person who does everything on time/ lets a clock run his 
life./ They don't want a clock or any machine/ to have that much power over their lives./ 

CLOZE 

EST I. The VOM ( Volt-Ohm-Millimeter) is the instrument used most often by the tech 
nician. It is used to measure voltage, resistance, and current; and because of this multi- 
usage is also da MultimeteL In electronic shops and _ * industry, you will hear 

it called VOM or Multimeter. 

It is important - the student of electronics learn this - - and how to use it 

correctly safely. What bread and water are the human, the VOM is to 

technician. You must learn to read — scales, where to connect the meter 

, how to position the f range and switches, and how to measure voltage, 

and current. 

Voltage is an electromotive _. It is the force from a or generator that 

drives our radios . electrical appliance. Voltage is the force — . causes current 

to flow. Voltage is in volts. 

Current is the movement electrons through the wire. This movement 

forced by the voltage. Current is - ~ in amperes or milli-amperes. 

Resistance the opposition to current flow. It measured in ohms. 

EST 2. Two cases of zero potential difference and no current can be considered in older to 
emphasize the important fact that potential difference is needed to produce current. Assume 

a copper wire to be — itself, not connected to any voltage , so that there 

is no potential across the wire. The free electrons _ _ - the wire can move 

from atom atom, but this motion is random, — any organized drift through 

the wire. — the wire is considered as a - — , from one end to the other^-r^^-^- 

current is zero. As another example, ______ that the two ends of .he have the 

same potential. Th^n free _ cannot move to either end, because - ends have 

the same force, and is no current through the wire. practical example of 

this case of . potential difference would be connecting both of the wire 

to just one . of a battery. Each end of wire would have the same po- 
tential - - there would be no current. The _ , therefore, is that two connec- 
tions are . to two points at differ^ potentials order to produce current. 

EST 3. The." aie many kinds of tools. . _2 — useful kind is the screwdriver. There __ — — 

two types of screwdriver but both have the same purpose: to tighten „ 

loosen screws. The type used for - or loosening Phillips-head screws is the 
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CLOZE (continued) 

Phillips screwdriver. The one which and loosens standard screws is the 

screwdriver. No matter which type is , the process is the same. It _ _ very 

important to select the correct size for the particular j'oh you ____ are going 

to do. The screwdriver must - , the screwhead exactly. If the screwdriver is 

* too small or too large . . _ the head of the screw it . probably damage it. 

Once the correct size has been selected, the screw . ,_ be tightened by turn- 
ing it clockwise. loosen the screw, on the other . - turn it in a counter- 
clockwise _ 

Wrenches, another type of tool, come - - a variety of shapes and sizes. screw- 
drivers, wrenches are used to make- __ tighter or looser. They are used - _ 

tightening, nuts and bolts. One of _. most important types of wrench is at 

both ends and is called open-end wrench for that very reason. either 

end of this kind of may be put to use, it said to l>e reversible. The box- 
end almost always has two sides too. the open-end type, it is reversible — 

is a useful end on either of the handle. 

EFL i. ¥fom here to there 

Long, long ago, there was no way to tell time. There were no clocks. There were . . ~ 

calendars. There were no rulers, either. did not think about how long how 

wide anything was until they to own land. Then, many kinds measure- 

ments were needed. 

To own land, man had to know where his _ ended and someone else/ 

began. To . a house, he had to know big to make it. To trade 

goods for things he wanted, he to know how much his things worth. To 

know when to plant crops, he had to have some of telling time. 

Little by little, __ learned to measure to find out _ things he needed to know. 

EFL 2, Half Go Hungry 

There are over two billion people in the world. Each day, around 324,000 babies are _ , 

and the world's population grows larger. the present time, there is not 

food for everyone. More than half world's population does not get enougli - - - 

the right kinds of food to _____ healthy. About 10,000 people in the ______ die of 

hunger every day. 

Trying _ feed the world's billions is a task. It is a job too _ for 

some nations working alone. It - ~ . a job for many nations of _ world working 

together. With the help . . modern science, tools, and machinery, many can 

lie helped to grow more 

Over sixty countries have joined together .___ form the Food and Agriculture Organi- 

lation -_~ the United Nations. These nations work to fight hunger. 

The Food and . _ Organization searches for new and better to grow food. 

It helps fight ___ . and plant diseases. It sells seed farmers at low cost. It 

works _ turn deserts and waste lands into lands. It helps to improve the 

. — of the world's peoples. It sends - of agriculture into those countries that 

help. 

EFL 3. Science and Technology 

There is little doubt that science and technology are closely related. In fact, technology is 

based on ________ knowledge. Scientists are coastantly discovering new „ and 

gathering additional information about matter __ . energy. Putting these principles and 

information - use is technology. 

Although science and are usually closely related, improvements in __ - - do 

not necessarily require a great . — of scientific knowledge. In ancient times, . _. 

example, the wheel was used. The - - of the wheel was a great advance. 

Pulleys and windmills were invented used long before the term scientist - 

into use. 

Modern technology seems to getting more and more dependent on - - sci- 
entists. Scientists have been the key _ _ . . _ in several examples of modern advances - 
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CLOZE (continued) 



technology. For example, computers, atomic reactors, — lasers were developed by 

terns of 

*. Reeding Comprehension EST 
The human ear can hear a wide range of sounds. This range is measured in units called 
decibels. The sound of a whisper has a decibel level of 20, and the sound of conversation has 
a decibel level of 60, and the sound of an airplane is about 110 decibels. Sound becomes 
noise (and can begin to damage your hearing) when it goes above 86 decibels. Of course 
sounds below 86 can be called noise when they interrupt other sounds and are unwanted. 
Doctors have foe d that electric guitar* and other amplified instruments may produce decibel 
levels from 90 to 105. These levels, according to some doctors, can begin to cause deafness 
in the musicians and their listeners. 

L Which of these does no* define noise? 

1. a sound above 86 decibels 

2. sound that is unwanted 

3. a sound that interrupts other sounds 

4. a sound with a wide range 

2. What does "unit" mean in this passage? 

1. an amplified guitar 

2. a fixed amount 

3. a noise 

4. an inch 

3. If something had a decibel level of 60 it would not be: 

1. a sound 

2. a noise 

3. a cause of deafness 

4. unwanted 

4. The decibel Ufc^at which a sound can U> called a noise is: 

1. not exact and absolute 

2. determined by doctors 

3. 105 decibels 

4. the decibel level of airplanes 

A phonograph is a thin, flat plastic disk in the shape of a circle. Sound is recorded on a 
phonograph record in one long spiral groove. Sound results from the movement of waves in 
air. These movements, or vibrations, direct a sharp knife-like cutter that scratches a wavy 
groove into the plastic disk. The cutter moves in a continuous path around the disk from the 
edge to the center. The patterns of the waves in the groove change as the pitch and loudness 
of the sound change. When the disk is played on a phonograph at the same speed at which 
the sound was recorded, the waves in the groove make tlie phonograph needle and arm 
vibrate. An amplifier is used to make these vibrations strong enough to hear as sound. 

5. A phonograph needs all of the following to make sound except: 

1. The record moving at the same speed that the cutter moved 

2. an amplifier 

3. f| knife 

4. ^ir 

6. If the patterns of the waves in the groove do not change, the sound on the record will: 

t £ be louder and higher than normal 

* ' not change 

/ 3. not be heard without an amplifier 

4. need a sharper phonograph needle to make the sound change 
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Reading Comprehension EST (continued) 

7. The word "spiral" in this passage means: 

1. circles inside each other 

2. an unending line 

3. a cut 

4. a path that circles around in smaller and smaller circles like a coil 

8. The function of the amplifier is to: 

1. create vibrations 

2. increase the strength of the vibrations 

3. change the groove pattern 

4. make the disk stronger 

Matches are called safety matches when they can be lit only by rubbing them against the 
stilling surface on the outside of the box in which they are packaged. The match -lights" 
because there is a reaction between the chemicals in the head of the match and the chemicals 
on the striking surface of the box. 

The composition of the match head includes a chemical that carries oxygen, such as lead 
oxide; a chemical that is inflammable, such as sulfur; and substances that cause friction, 
such as powdered glass. Glut-like elements bind everything together. 
When the head of the match is rubbed against the striking surface, friction causes heat on 
a small area of the head. The heat frees the or m from the chemical carrying it, and the 
oxygen Joins with the sulfur to form sulfur die This causes more heat, which in turn 
causes more oxygen to be freed and to be combined with sulfur to make more sulfur dioxide. 
The chemical process is so fast that the match appears to catch fire all at once. 

9. A safety match needs all of the following to light except: 

1. a chemical that carries oxygen 

2. coloring matter 

3. v the striking surface of the box 

4. friction 

10. What does "composition" mean in this passage? 

X a written page * 

2. a combination of things 

3. the shape of the match head 
. 4. powdered glass 

11. What does "striking surface" mean in this passage? 

1. the box 

2. the outside part of the box 

3. cardboard 

4. the label 

12. The purpose of this passage is to: 

1. describe how a safety match works 

2. warn against unsafe matches 

3. list the chemicals in safety matches 

4. describe how oxygen combines with sulfur 

During a radio broadcast or tape playback, , the level meter indicates the condition of the 
batteries. If the pointer is in the GOOD range, the voltage <s sufficient. If it is in any other 
position, all the batteries should be replaced with new ones. If the batteries are weak, slow 
tape speed, sound distortion or low volume will occur. 

13. The level meter operates only: 

1. if the batteries are weak 

2. when the set is on 
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ft«*oW Comprehension EST (continued) 

3.* when the pointer indicates GOOD 
• 4. when there if sound distortion 

14. Sow tape speed and sound distortion are the result of: 

1. low volume 

2. sufficient voltage 

3. tape playback 

4. battery weakness 

15. The condition of the butteries c-n best be determined by: 

1. / the range the LEVEL meter pointer is in N 

2. volume ^ 

3. visual inspection of them 

4. distortion * ' \ 

16. If the pointer is outside the GOOD range: * 

1. voltage is sufficient 

2. tape speed will improve 

3. batteries should be replaced 

4. the batteries are okay 

Drills for* wood or metal differ slightly in their shape, but their mode of action is almost 
identical, whetner operated by hand or electrically powered. The wood drill (or bit X has a 
screw-like part at its center which drives into the wood and pulls the cutting edges after it. 
The cutting section approaches the wood with two sharply pointed knife edges that spear 
straight down into the wood and cut through the fibers in the cross grains so that they won t 
*i .inter. The cutting edges themselves, two knives which turn and slant into the \* r x£ clean 
out all the wood between the spearing blades and carry it up along the twisted path on the 
body of the drill 

17. Which part of the bit enters the wood first? 

1. the cutting edges followed by the screw-like part 

2. the drive 

3. the screw-like part at the center 

4. the knife edge 

18. Wood drilk are powered: 4 

1. always by hand 

2. always by electricity 

3. sometimes hy hand, sometimes electricity , 

4. bythe bit 

19. Wood and metal drills are: 

1. different in shape and mode of action , 

2. a little different in shape but much the same in action 

3. different in action but identical in shape, 
, 4. exactly the sai** <» in shape and action 

20. The passage is: 

1. about wood drills and metal drills equally 

2. mainly about metal drills 

3. is concerned with all types of drills 

4. is mainly about wood drills 

Metal drills oWt need the spearlike parts since metal has no grain and it will be cut 
smoothly just by the sharpened drill end. Usually die metal drill scrapes away the material 
at the very center of the hole and cuts it away out toward the periphery. It is driven into 
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Reading Comprehension E§T (continued) 

the metal by very high pressures and speeds in the drill press or even by hand-held electric 
drills. A hole larger than % inch is usually made by first drilling a smaller hole and then 
mjUiing with a drill of the desired hole size. # 

21. Drilling a heW larger than Va inch ki metal requires: 

1. spearlike parts 

2. the use of more than one drill bit 

3. a special drill press 

4. no grain * ' r - 

22. The spearlike parts found on wood drill are: 

1. unusual on metal drills , 

% 2. characteristic of metal drills 

3. not a part of petal drills 

4. needed on all metaj drills > 

23. Is there any reference made to wood drills in this passage? _ 

1. Yes, indirectly, in sentence number one 

2. -No, there are no references at all 

3. Yes, there are references made|throughout 

4. Yes, there is a reference made in the last sentence 

24. One function of a drill besides simply drilling is: 

1. determining a hole size 

2. measuring 

3. waste removal while drilling 

4. avoiding high pressure 

The important thing to remember when wiefding a hammer is that the speed of the hammer 
head, not your muscle, is what gives maximum impact. It is more important, to get a good 
stroke well placed than it is to get a roundhouse blow. Try to strike the nail witb the hammer 
handle parallel to the work at impact.* This means Ihr-i your hand will be just a little farther 
away from the surface of the wall ^or- cabinet than you would normally think is right. But 
by keeping the hammer face flat on the nafl head on impact, the nail will drive true and 
straight rather than off to one sider-and there's no wasted power. Also try to keep in mind, 
that it is the head of the hammer doing the work, not the arm. Give it a good rotating swing 
from the elbow, not necessarily from the shoulder. 

25. The best distance between the hand and the surface being worked on is. 

j 

1. twice the length of the hammer 

2. less than most people think 

3. as near as possible 

4. more than most people •ni^k 

26. r To use a hammer with greatest efficiency 

1 a good roundhouse blow is important 

2. your hand should be close to the work surface 

3. speed and .accuracy of stroke are important 

4. maximum muscle power is most important 

27. Which of the following does not lead to wasted power when hammering? 

1. depending upon muscle strength 

2. keeping the hand very close to the object !>eing hammered 

3. use of roundhouse blows 

4 use of a rotating swing from the elbow 
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Reading Comprehension EST (continued) 

88. Which of the following does not contribute to maximum hammer impact? 

1. ■el e cti on of the heaviest hammer available 
% hammer head speed 

3. keeping the hammer parallel to the work at impact 

4. greater distance from the surface than you would iiormaUy think correct 

A cell produces electricity by chemical meant. That's the basic definition. The many available 
typos have been divided into general kinds— primary and secondary. Those cells that are not 
rechargeable (flashlight D cells, for example) are primary cells, and those that may be re- 
charged (automobile batteries) are secondary cells. The primary cell is disposable. It is so 
inexpensive that you can afford to throw it away and buy a new one when the old one has 
been drained. The secondary cell is rechargeable. When drained of current, it can be re- 
charged by applying an external voltage supply to the terminals and reversing the chemical 
action that took place during discharge. 

29. Cells that are not rechargeable are called: 

L primary cells 

2. flashlight cells 

3. secondary cells 

4. automobile batteries 

30. Secondary cells are: ^~ 

1. non-rechargeable and expensive 

2. expensive and rechargeable ' 
X inexpensive and rechargeable 

4. inexpensive and non-rechargeable 

3L This paragraph is concerned with a comparison between; 

1. D cells and other flashlight batteries 

2. expensive and inexpensive types of batteries 

3. primary and secondary cells 

4. automobile and flashlight batteries 

32. Which of the following adjectives describes an automobile battery? 

1. secondary, non-rechargeable 

2. primary, rechargeable 

3. rechargeable, secondary ■ 



4. non-rechargeable, primary 
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Surveys of graduate programs reveal that few elementary teachers receive 
certification or degrees in TESLTEFL. It is more common for ESL teachers 
at this level to be recruited from regular classroom assignments. Elementary 
teachers, therefore, arc a vital audience for in-service training in language 
pedagogy. 

This paper discusses some sociolinguistic, sociopolitical, and attitudinal 
factors that appear to contribute to perpetuating a second-class status for 
elementary-level ESL teachers relative to their peers in academic or credential 
programs that are meant to train them, in the schools and society in which 
they work, and in professional organizations such as 1 ESOL where their par- 
ticipation is often less than that of other special interest groups. 

Drawing on their experiences as teachers and trainers, the authors re- 
examine some assumptions about course content and goals specific to this 
audience. They indicate the need to resolve the dilemma about whether to 
provide short-term training programs in response to teachers' expressed need 
for immediate, tangible, and practical help, or long-range education involving 
theoretical bases of language pedagogy designed to prepare trainees to function 
independently, 

Since its inception in the mid-sixties, TESOL has been concerned with 
providing a common meeting ground for practitioners from every instructional 
level — elementary, secondary, college/university, and adult. By avoiding arti- 
ficial divisions, this professional organization has emphasized its focus on the 
whole field \>l English language teaching. However, as the field has grown, so 
has awareness of the variability among learners as well as of teaching circum- 
stances. English language teaching in 1979 is not uniformly the same wherever 
it is practiced. Thus, any consideration of the current state of teacher training 
must be stated in terms of a particular educational context or level. 

Our interest here is in elementary- level public school teachers. How do 
they become ESL teachers? How are they prepared for the job? Our concern 
is that, like Cinderella, they appear to find themselves the «te F - sisters of their 
peers — it. academic degree or credential programs that are meant to train them, 
in the schools and society in winch they work, and in professional organizations 
such as TESOL. For example, compare the number of presentations at recent 
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TESOL Conventions which come directly from elementary ESL classroom ex- 
perience with that from any other level Out of almost 250 papers, panels, and 
workshops on the program of the 1978 TESOL Convention 4n Mexico City, 
there were only ten presentations directly concerned with the elementary level 
and most of these were in the context of bilingual education. Similarly, of the 
nineteen articles on teicher training listed in the TESOL Quarterly Index for 
Volumes I-r-X, none are specifically concerned with elementary level teachers. 

What we see is a classic example of a self -perpetuating cycle: lack of solid 
professional preparation, lack of status and confidence as practitioners, lack of 
incentive or recognition for participation in academic activities or professional 
organizations, and lack of adequate representation, appeal, or opportunities for 
personal and professional growth at the institutional or organizational level. 

The purpose of this paper is to explore the background of this situation. 
In so doing, we wish to share with other trainers, potential trainers, and elemen- 
tary classroom teachers themselves (the Cinderellas), our experience as trainers 
of elementary-level ESL teachers. Our base for viewing this neglected audience 
in TESOL combines two decades of planning, organizing, and conducting 
courses 1 r teachers, both as in-service classes and workshops within a school 
district, and as continuing education courses through a university. We also 
wish to validate our views with those of people doing similar work in other 
parts of the country. 

What sets elementary teachers of ESL apart from their peers in other 
levels of instruction? Let us begin with the selection process. Most elementary 
ESL teachers come from the ranks of classroom teachers. Generally, they have 
had a regular teacher-training program organized around the traditional subjects 
of math, social studies, reading, and language arts. Language arts preparation 
is quite different from the content of TESL-based academic courses such as in- 
troduction to linguistics, or second language acquisition, in that it assumes that 
students are native speakers of the English language. 

Prospective ESL teachers may be selected, they may volunteer for the 
job, or they may simply have a number of limited or non-English speaking stu- 
dents in their regular classroom. The basis for being singled out by school 
administrators is often because they speak the first language of the majority 
of the ESL student population, because they have an "ethnic" or immigrant 
backgrou nselves, or because they express a special interest or concern. 

The mot. cor volunteering may include a desire to get out of the drudgery 
of a reguiai assignment, a wish to be different, to have a special assignment 
or title, or a readiness to gain new skills and experiences. Rarely do elementary 
teachers enter ESL teaching at that level because of prior training or experience 
in TESL. 

Occasionally, we encounter a "Pollyana" response which maintains that 
teaching ESL is like teaching any ordinary class: there is no need to learn a 
new job, regular methods and materials will do. We also sometimes find a 
patronizing attitude that understanding or doing something for the poor children 
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is enough. Fortunately, many recognize the need for help and seek it. Whether 
desparately struggling for their own survival in a new and difficult situation, or 
altruistically seeking the best way to meet the special needs of ESL students, 
teachers who come for on-the-job training through in-service or extension classes 
are making a realistic response to a new challenge. It is our responsibility as 
trainers to recognize their needs and to help them make the transition. But 
unlike fairy-tales, there is no magic wand. * 

We must realize that this is a different population from those having 
certificates or graduate degrees in TESL. Our typical Cinderellas probably have 
•a richer background in classroom management techniques and curriculum in- 
novation (Taylor, 1978). They understand the complexities of the total elemen- 
tary aehoo] instructional program. They understand the intricacies of organizing 
independent activities. They know how to get students to work in small groups. 
They are familiar with arranging learning centers and are concerned with 
making the classroom environment exciting and stimulating. They think in terms 
of realia and visuals. In fact, many graduates from TESL training/degree pro- 
grams, welt-versed as they are in language acquisition theory, could probably 
benefit from such expertise in these other areas. 

This, then, is the usual background of the people who come for in-service 
training or continuing education. When we see them in the role of students, 
they manifest certain characteristics which adu up to a composite picture of 
insecurity. Teachers often come as a group from their particular site, looking 
to one another for mutual support. Their expectations are set at picking up 
samples of everything. They freely express their need for help, but feel it should 
be in the form of something they can put into their hands rather than into their 
heads. They want quick recipes for what to do tomorrow, in the form of hands- 
on materials such as games and visuals. They often evaluate an in-service 
workshop in terms of how much they can take back and use themselves or 
put into a learning center for their students ft> use. We do not intend this to be 
a negative characterization. In fact, we see the give-me-one-of-everything re- 
action as a very normal and understandable response to the frustrations and 
realization of the enormity of their task. Our intent is to make trainers aware 
of the teachers' need for immediate, practical, tangible help in '#vercoming 
their sense of inadequacy. 

In addition to these personal concerns, there are complex external factors 
contributing to elementary teachers' feelings of insecurity about being ESL 
teachers. There are any number of conflicting attitudes and opinions held by 
other teachers as well as the general public on the subject of "language use and 
language learning in the public schools, some of which are enumerated below: 

1 ) Why is ESL such a big issue? Haven't waves of immigrants throughout 
the history of this country learned English as a matter of course in the public 
schools? 

2) What's wrong with the natural approach to language learning? Doesn't 
it happen by exposure, by osmosis? Why is there a need for directed instruction? 
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3) Isn't interfering with children's language damaging to their self-image? 
Why should we impose English on the child's natural growth and development? 

4) How do we reconcile the recognition of students' right to their own 
language with the reality that they need English to achie\e minimum com- 
petence for graduation and success in securing jobs? 

5) Do. the language needs of limited-English speakers differ from those 
of non-standard English speakers J Should they receive the same type of lan- 
guage instruction? 

6) Does pulling out children for ESL class or to a language lab have a 
damaging effect by tracking, isolating, or labelling them? What are the alterna- 
tives, and what are the relative merits of different ways of grouping for ESL 
instruction? 

Trainers who are prepared to address themselves t< sociolinguistic ques- 
tions such as these early in their program can usually establish a higher degree 
of mutual understanding and respect with an elementary-level audience. Train- 
ers must also be prepared *o dval with socio-political issues concerning the role 
of ESL in the context of bilingual education such as: 

1 ) Advocacy or polarization of bilingual vs. ESL instruction and instruc- 
tors, as if one negated or precluded the other, 

2) An assumption on the part of some school boards and administrators 
that ar.yone who speaks English can automatically teach it. This ignores that 
fact that though they do read or use math in their daily Jives, teachers have had 
to undergo training and qualify for credentials in order to teach these subjects. 

3) Bilingual teacher certification and programmatic requirements specify 
fluency and literacy in the students' language and knowledge o£ the culture 
associated!* with that language but tend to omit any standards for competence 
m English, or knowledge of how language is learned or taught. 

4) Similarly, and possibly consequently, recruitment and in-service efforts 
have developed the' same emphases and thus virtually ignore the whole popula- 
tion of monolingual or other teachers who could and should be identified or 
trained to teach ESL. 

5) Staff development activities are often devoted to explanations of the in- 
tricacies of lega^ requirements and criteria for compliance with state and federal 
mandates but seldom spell out how these goals are to be carried out in the 
classroom. At best, they are prescriptive or proscriptive, but gi>^ the teacher 
little choice or direction, individual practitioners have to figure out how to do 
it themselves. 

6) Then is ambiguity and ambivalence about where ESL belongs — 
whether it is considered a part of the bilingual program or a legitimate part 
of the curriculum with a content, scope and sequence of its own. ESL instruc- 
tion competes for dollars, stall, materials, and space with primary language 
instruction in bilingual programs on the one hand, and conventional instruction 
in basic skills on the other. 
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7) Teachers feel fnistration when they have insufficient time on a regular 
daily basis, or for a duration of longer than one .school yea , to give students 
enough specific ESL instruction to see concrete results. The absence of demon- 
strable positive outcomes leaves teachers with feelings of dissatisfaction about 
their jobs, and concerns about keeping them. 

Thus, we sec that ESL teachers in the context of bilingual education may 
feel resentful or cynical. They sense that the scene is charged with as many 
emotional and political questions as pedagogical ones. They see themselves in 
a second-class status in relation to other program instructors, licensing and cer- 
tification, instructional priorities, and job security. Given these and other un- 
resolved issues, is there any wonder that new and prospective elementary- level 
ESL teachers aproach their tasks tentatively? 

And given this complex array of factors affecting instructional concerns, is 
it any wonder that training programs originating in academia and operating 
only on a theoretical or abstract level usually fail to attract or hold an audience 
of elementary-level ESL teachers? An understanding of what is appropriate for 
this audience is the trainers' real dilemma and challenge. Experienced trainers 
recognize that there are no simple, clear-cut answers to the many questions in 
the field, yet they must make decisions about their own roles and goals. 

The trainers' dilemma over whether to start with theory or application was 
faced some years ago when Peace Corps TESL TEFL programs were in their 
prime. Campbell (1967) discussed the distinction between long-and short-range 
programs, and between "education" and "training*. He pointed out that if 
trainees understand basic principles of language and language pedagogy, then 
they are able to answer their own questions. However, this kind of understand- 
ing takes considerable time to acquire; trainees still need something to use 
tomorrow. 

How do we reconcile the needs tor long-range education vs. short-term 
training mi working with, elementary-level teachers? There are two main areas 
of consideration: assumptions about course content, and attitudinal factors. 
Many conscientious trainers believe that their uitimate goal should be to get 
trainees to the point of "answering their own questions and making their- own- 
decisions. But they automatically assume that the first step, or lesson one, should 
be a lecture on the nature of language, followed by assigned reading and 
further lectures designed to provide in-depth background in the fields of second 
language acquisition, sociolinguistics, grammatical theory, and on issues of lan- 
guage pedagogy, cross cultural awareness, and curriculum planning. Even when 
such courses begin with theory and promise to get to application later, the result 
can be alienation, hostility, or no return audience next class or next term. 

In practically every group of trainees, there are some individuals who are 
stimulated by an abstract approach and who respond positively rather than 
being bored or antagonized. However, back in the classroom, they run the risk 
of even greater disappointment if their increased awareness is not tied to prac- 
tical concerns. If. on the other hand, trainers try to be responsive to teachers' 
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needs by providing them only with survival techniques and materials for im- 
mediate classroom use. are they being responsible or responsible for simply 
perpetuating teachers' dependence on hand-outs? 

Trainers should examine themselves for latent or implicit signs of con- 
descension when addressing teachers "down" at the elementary level. Often 
unconsciously, they express the attitude that the higher the instructioral level, 
the higher the intellectual capabilities of the audience. Tiainers who talk over 
the heads of, or down to, elementary teachers show some striking parallels to 
inexperienced teachers of ESL who often speak too distinctly, loudly, or slowly 
to their students. 

Trainers should be honest about their own background and experience. 
If they have had little or no exposure to the realities of the trainees' jobs, in 
terms of locations, organization, time constraints, or programmatic requirements, 
they are at a disadvantage. However, if trainers are working without first-hand 
knowledge of the trainees' circumstances, it is best to be straightforward about 
the situation and ask the trainees to supply necessary background information. 
Sensitive trainers should come away from this experience feeling that they have 
learned as much from the participants as the other way around. 

Trainers need to know if they are facing a captive or a self-selected 
audience. They should also differentiate between an audience composed of 
novices as opposed to more experienced people. But no matter how diverse 
the audience, trainers can usually count on there being some who want how-to- 
do-it formulas, while others are looking for more general background and basic 
principles. Both groups should he provided for. The challenge for trainers is 
to come qp witnuiings that trainees can learn from themselves as well as use 
with their students. m 

Trainers need to come out of the ivory tower. Successful programs draw 
on practical examples which relate to the trainees' own experiences. Here again, 
a parallel can be drawn with good ESL instruction that goes beyond frontal 
teaching and empty drills to involve students in realistic and meaningful con- 
texts and situation?, In this connection too, trainers need to decide if and to 
wliat extent they should function as a sounding-board for teachers anxieties 
and socio-political concerns. 

In sum, the best kind oT training program should provide a model itself. 
"The medium is the message" aptly applies to teacher training. The processes- 
through which training takes place can |je used as examples of how to teach 
language to elementary school children. 

A vital goal in such |*fraining program for elementary-level teachers is to 
help eradicate the Cinderella complex. The teacher who possesses confidence 
that comes from appropriate training and successful experiences is ready to 
enter into professional activities. We hope that at next year's TESOL Conven- 
tion, we will see more presentations directed to this audience, and more elemen- 
tary teachers trying on "Cinderella's slipper" and addressing themselves to issues 
of their neglected level. 
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The growing demand for ESL teachers, caused both by legislatively mandated 
classes for United States residents of limited English-speaking ability and by 
the influx of foreign students into the country, has resulted in the addition of 
ESL components to tracher training curricula in many colleges and univer- 
sities. Often these ESL components function, under the tutelage of already 
existing curricula in Foreign Language or in English, where ESL becomes 
something of an academic stepchild, with similarities stressed and differences 
ignored. Yet ESL students in monolingual, multicultural programs are quite 
different from students of English as a first language, foreign language students, 
or ESL students in bilingual classes. Nor arc the monolingual ESL programs 
themselves the same as those in English as «i first language, foreign language, 
or bilingual education. The dissimilarities include, among others, amount and 
sources of funding, student motivation, geographic setting, ages of students, 
levels and* orientation of languages taught, and cultuial emphases. Because 
of the growing demand for ESL teachers, and in view of the fundamental 
differences between ESL programs and other language programs, this paper 
argues for the development of ESL teacher training programs unaffiliated with 
English, or foreign language, or bilingual education, and for state certification 
of ESL teachers, in order that ESL teacher training may be expanded into the 
complete curriculum that it deserves to l>e. 

Programs for the education of ail language teachers clearly share many 
areas of preparation. The overlapping of subject matter, materials, and aims has 
long made the fields of teaching English to Speakers of Other Languages, teach- 
ing the various foreign languages, and teaching in bilingual programs inter- 
dependent from the viewpoint of research and cooperation. Much of the recent 
literature cites common ground and the sharing of common concerns among 
our disciplines (Alatis 1976b; Kunkle 1977; Finocchiaro and Ekstrand 1977). 
While we endorse all pt).v.sible~cxxip?ratiori among our inter related-fiekfe, we 
feel that there is a growing need for English as a Second Language (ESL) 1 
teacher preparation curricula that are separate and independent from those for 
bilingual education teachers, those for foreign language teachers, or those for 
teachers of English as a first language. 

* We are aware of the technical differences implied in the designations English as a 
Second Language (ESL), English a< a Foieign Language (EFL), and the generic English 
for Speakers of Other Languages (KSOL). In this paper we have chosen to use ESL generi- 
cally because it is the more popular designation with most educators, and because it is ap- 
propriate with resp* a to order of language acquisition. 
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Our premise is that students of ESL who are not enrolled in bilingual 
education programs are different from those who are and different also from 
typical American foreign language students. Further, these ESL students are 
the fastest growing body of language students in the U. S. today. Our con- 
clusion is, therefore, that these students deserve to be taught by language 
teachers thoroughly prepared in programs that, while sharing similar aspects 
with bilingual education, foreign language education, and English education^ 
are, for the most part, independent of these, just as these curricula are in- 
dependent of each other. It follows, then, that such teacher candidates deserve 
in turn to be prepared in programs directed and taught by those thoroughly 
versed in all aspects of TE.SOL. 

Three main groups oc ESL students not enrolled in bilingual programs 
can be identified: foreign students who study ESk in our colleges and uni- 
versities; ESL students m Adult Basic Education program! and elementary 
and secondary students emoHed in Multicultural or ESL programs. A quick 
look at the numbers involved will give an estimation of the need for profession- 
ally trained ESL teachers. 

Concerning the first group, the annual survey of the Institute of Interna- 
tional Education reports that during the 1977-78 academic year there were 
235,-509 foreign students on U.S. campuses, an increase of more than twice the 
enrollment of ten years ago (HE Reports 1979:5), and an increase of 15$ in 
only one year (Time 1979.63). While not all of these students are enrolled in 
ESL classes, it is safe to assume that the majority have l>een enrolled { n at 
least one ESL class since their arrival in the United States, and many have 
been studying nothing but ESL for a year or longer. The number of foreign 
students studying in the U.S. in 1977-78 was almost equivalent to the number 
of students in the U.S. who studied university level French two years before 
* that. And if only one-third of the foreign student population was enrolled in 
ESL at any one time, there would still be as many as studied German in all 
the colleges and universities in the U.S. two years ago (based on enrollment 
data reported in Brod 1976:169). 

The second group, ESL students in Adult Basic Education, is potentially 
the largest group of all According to estimates based on the Survey of Income 
and Education conducted hy the National Center for Education Statistics of the 
Department of Health, Education and Welfare, approximately 2.4 million per- 
sons in the U.S speak no English at all, and of these 9($. or 2.16 million, are 
adults nineteen years or age and oldei (Waggoner 1978:252). Instruction to 
equip adult© with basic literary skills is provided as a part of the Adult Basic 
Education Act funding, English for adults who are non-English speaking is 
interpreted as a basic literacy skill. 

Finally, the third group is comprised ot those students in elementary and 
secondary schools from non-English speaking backgrounds who are not enrolled 
in bilingual bicultural programs. The survey referred to above identified more 
than ten million children between the ages of six and eighteen who are of 
nou- English language backgrounds (Waggoner 1978), although not necessarily 
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non-English speaking. Clearly many 01 these students will he enrolled in multi- 
lingual/multicultural programs, or some type of ESL program. The 1975 Task 
Force Findings Specifying Remedies Available for Eliminating Past Education 
Practices Ruled Unlawful under Lau v. Nichols specify that ,4 a district does 
Jiave an obligation to serve any students whose primary or home language is 
other thai, jiglish" (p. i)r Even discounting from this number of 10 million 
those children who are in bilingual bicultural programs, and those who have 
become English-speaking, the number studying in multilingual /multicultural 
programs or in ESL programs according to the specific J guidelines ought to 
be large. 

Further endorsing the idea that the ESL population in the U.S. is large, 
Kunkle (1977:213-214) identified 4,913,(XK) non-native Americans and residents 
who have serious difficulties with the English language. In his article he states 
that "a serious commentary is that of this group of almost 5 million people with 
difficulty :u English, approximately 4 million have not been enrolled in school 
at any time since September, 1974 (Kunkle 1977:214). If this is to be recog- 
iii/€»d as a problem, as undoubtedly it is, tb"ii we must ask ourselves, as pro- 
fessionals, what is the source of this problem? 

A major contributing factor not only to this problem but also to ones deal- 
ing with funding and recognition of ESL on campuses and in state supervisory 
offices is the program s uncertain status. Fanselow and Light (1977) present the 
historical perspective of why this has been so. They state that after World 
War II 

in spite of the tact that the teaching of English to non-nati\e speakers of English 
had been expanding great l> . it was difficult to know during those years what 

the needs of the field were and what the qualifications ol the teachers were or should 
he One of the reasons for this no doubt was the widespread attitude that speakers 
of a language can teach that language to non-native speakers by .rtue of the fact 
^that they speak the language! Another reason, perhaps ( i result of the first, was that 
stilt* education agencies usually had no offices to handle this area. Even on campuses 
where English was taught to foreign students, tile status of the programs and the 
teachers in them was often not clearly defined ( p, 3) 

We have not outgrown tins heritage. We still lack a national concensus as to 
which departments should house programs' for ESL students, which depart- 
ments should house the programs that prepare the teachers of these students, 
and which state and local offices should supervise the programs that give di- 

- In regard to the role of English ,is a Second Language m elementary bilingual programs, 
Nakano (1977.219-234) notes that "the single most inflammatory a d injudicious statement 
made in the (K-K Remedies is that 

Since an ESL program does not consider the affective nor cognitive development of the 
students in this category and time and maturation variables are different here than for 
students at secondary levels, an ESL program is not appropriate (p. 232)." 
This statement is unfortunate, since the 1976 TESOL Resolutions on Certification specifically 
call for a cultural component in TESL certification (Fanselow and Ligjht 1977:250-251). The 
task force findings, however, specify some type of "plan" { biltngurtl/blcultuH, multilingual/ 
multicultural, transitional bilingual education, or high intensive level , training — the latter at 
the secondary level only) 'when the district has 20 or more studenn of the same language 
Q identified as having a primary home language other than English" (p. 4). 
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rection to the teachers in them. In fact, the present situation can be described 
as little short of chaotic! 

Across the nation we find ourselves as a profession subsumed under such 
faculties as Foreign Language, Bilingual Education, and English, none of which 
can ever prove entirely satisfactory. Often ESL teacher preparation does not 
have a complete curriculum, but functions under the tutelage of already existing 
curricula in foreign language or in English where it becomes something of an 
academic stepchild, with similarities emphasized and differences ignored. At 
state and local public schools levels, ESL supervision and program develop- 
ment may be housed under English, or under foreign languages, or under lan- 
guage arts, or under bilingual education, or responsibilities divided among these 
various offices, or all of these combined into one office. In a survey of state 
foreign language supervisors we asked where ESL supervision at th*» "tate level 
was housed. Of the twenty-three responses not one responded that ESL was 
housed in its own separate department or office. The May 30. 1978 issue of the 
Chronicle of Higher Education addres ed this same problem 

The uncertain status of EFL has led to debates over who should he teaching it 
On .some campuses, the courses are found in foreign language departments, on 
others, in regular English department. Still others view the courses as college prep- 
aratory, (p. 7) 

The uncertainty about who should be teaching ESL is caused, to a great 
extent, because people outside the field do not have a clear definition of the 
scope, practices, and theoretical b ^es of teaching ESL. Because of the un- 
familiarity with what we do, as well as a lingering hint of mystery, campuses 
and supervisory boards are often very unclear as to what department or even 
to what discipline we belong. * 

As professionals, we need to define the borders of our profession in such a 
way that no uncertainty will remphi as to who should be teaching uon- native 
English speakers. Naturally there is a reason why so many ESL programs are 
housed in these different def utmeuts across our country. Commonalities do 
exist; however, because there are great cl ill erei ices separating the fields, we 
propose a rationale fur separating ESL programs into separate entities taught 
and staffed specifically by people with expertise in ESL. The impressive num- 
bers cited as well as the lack of certainty about who should be teaching ESL 
students and what our status is support the need for sound ESL teacher training 
curricula. Strengthening the rationale that sue^i curricula should be separate 
from those designed to train bilingual teachers and foreign language teachers 
is the fact that ESL students themselves arc <|nire different from other language 
students. Some of these differences are: 

1) Geographic setting. As basic as this -distinction seems, it is surely the 
most fundamental difference, between ESL students in the l".S. and other lan- 
guage 1 students Adult ESL students generally are at sea* in a foreign culture 
which at the very least demands oi them survival skills in English. In contrast}, 
English as a first language students have their support systems — family, friends. 
q - ilture, as well as the linguistic environment— all intact. American students in 
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foreign language piogiams also have these. General!), their language study ts 
not for immediate use hut i athoi sei\es the goals ot a humanistic education or 
some hiture expenencc (Exceptions nun be geouraphic areas where the study 
or Spanish, or Creole French, or Haitian French has unm" ,: ate arid utilitarian 
, purpose.) E\en students in hihugual programs ha\e both support .systems and 
linguistic-cultural s\ stems that the\ can tall back on. In addit'ou. students in 
bilingual piograms will ha\e teacheis and man> peers who are .>now!edgeable 
about and sympathetic to then uiltuies Because ESL students probably are 
enrolled in classes composed ot main nationalities, the\ may have no class 
member nor a teachei who can speak 01 understand their languages, and they 
may have no one who undeistands their cultuial background. 

ESL students also diHei from tspical foreign language students in their 
opportunities tor the acquisition ot language The acquisition of English by 
ESL students is an inevitable lesult ot the d.ul\ environment, whether they are 
at school, at work, watching television, shopping. 01 p uticipatmg m leisuie-time 
activities Thev thus live m what Krashen i 1977 ^ calls an acquisition rich en- 
\uoiii "iii. as contiast ' to the a< qiusit ton-poor en\ ironinent ot tvpical foreign 
language students \s krashen no'es however. 

the adult sicond lantfuai^ tanner 1ms tun means t'i mternah/intf rules of the 
target laiiiOWKf One is languor acquisition, which is \ V r\ similar to the creative 
construction pnn^ss utilized u. !>oth child LI and 1.2 acquisition The other 

process .unliable to main adults is lawjuaije learning which results in t nscious 
representation ot petlayuyital rules <p 145) 

Given these two pioci-sses. LSI students ha\e the advantage ovei U ;;n lan- 
guage students, because while "acquisition ma\ be slow . it is, i ,he long 
run. much more useful when language is used toi the purpose of ct amunica- 
tion" t p. 1j6) 

2) A«e We have discussed language learning versus language acquisition 
in terms ot the adult language learner. Whei. the number of foreign students is 
added to the number of persons in the I" * who are non-English speaking, the 
total is more than 2mVU) person* who are potential ESL students, less than 
ten percent ot whom are nineteen \ears ot age or under (based on the data in 
Waggoner 1 ( )78) This does not include, ot course, the mor- than ten million 
children between the ages ot six and eighteen who are in>»i non-English lan- 
guage backgrounds M;my will be enrolled in bilingual programs, and many do 
speak English. Tims, most ESL stuoeets m c mtrast to the majority of students 
in bilingual programs and most stude it- in foreign language pr« .rams, are 
largely adult learners. 

3 I Motivation, 'Because ESL students are in the Inited States, and because 
they are for the most part older students, the\ are general ly motivated to learn 
English as a means ot achieving immediate ends— whether these are to gain a 
college education, to complete a '„raduate degree, or. as new immigrants or 
husinessmen. to survive in :\e English-speaking world. Tins motivation is what 
Gardner and Lambert (19<2> call hist rumcntal motivation, rather than the inte- 
~ Q motivation typical of some of the best language learners in biliirmal 

ERIC 

247 ^-n 



hnifliiiffnrrriaaij 



244 The Learner in Focus 

education programs. In any S case, gSL students are in this Tcspect different from 
English as a first language students, who take ftfr granted the very automaticity 
that ESL students ar* trying to develop They are different as well from \mer 
ican foreign language studt <ts, who' typically conceive of foreign language as 
a course in the curriculum, ratlier than a means to achieving itecessuiy survival 

skills. ' > i 

4) Cultural Emphases, For the most part, the ESL class, unlike the bi- 
' lingual education class or the foreign language class, brings together not just 

two cultures, but a rich variety" of cultures In this' multicultural environment 
ESL students share their cultures with others while at the same tune learning 
to appreciate the universality and the^skared human values of all cultures. Y f 
thetcommon bond of language and culture shared imong all class members is 
the English language, and the American culture, both of which are embodied 
in the teacher. 

5) Levels ami Orientation of Lmigiwg*. ESL students will generally study 
English longer than foreign language students will study foreign bnguages. 
Therefore, programs for ESL students will include moie levels of study and 
more still development in all areas of English. There will usually be less atten- 
tion to literature in programs for ESL students than in foreign language pro- 
grams. Because of the mixture 6f nationalities in the classroom, the orientation 
of classes and of materials will usually be in English only, rather than English 
and the native language. Additionally, the progress of motivated ESL students 
can be more rapid than that of foreign language students hecau*' ESL students 
are learning and acquiring the language. 

• 6) Sources of funding ESL students at the elementary or .\econdary level 
can attend tax-supported tuition-free schools (unless they happen to be un- 
documented aliens). ABE-ESL students also have no tuition to pay. But aside 
Him a small percentage of exchange students, other adult ESL students and 
college ESL students may pay their own way or their countries or corporations 
must pay their tuition. While American college students have a number of 
scholarship sources, ranging,- from local to federal funding programs, and from 
private scholarships to public grants, ESL students must bring ample funding 
along with their visas, or, at the graduate level, have sufficient command of 
English t ) secure teaching assistantslnps oi other means of support. 

7) Research Because ESL learners in general aie m an environment in 
which language acquisition may he more unpen taut than language learning 
much of the advancement of second language acquisition research theories has 
resulted from studies of ESL learners, ho* children and adults ( Bailev. Madden 
and Krashen 1074, Dulay and Burt 1974, Larscn-Fieeman 1 ( )7>1. On the other 
ham 1 , while main studies of learner attitudes have enlisted' the paitieipation of 
both foreign language students and ol those m bilingual progiams (Lett l?)77, 
Cooke 1978. Gardner and Lan hert 1972), their has been little nttitiidma) re- 
search at beginning levels <>l ESL W hile the cultiuc shock that foieign students 
suffer has been observed and icported ( Bedingfield 1 ( )7S), we have little em- 
pirical knowledge to tell lis how loieign students -feel about learning English 

® - beginning levels m how tin \ fun) ^Ifqut \meiKan cultui* lu most piogiams. 
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instruments designed to tap these attitudes wouliHtfeti to be translated into 
scores of different languages^ and reliability of the translation into each Ian* 
guage established, a time-consuming* and expensive task. 

We have descnbed several of the ways* in which ESL students in mono- 
lingual programs are different from ESL students in bilingual programs, from 
foreign language students and .torn English as a first language students. These, 
along with a few other obvious differences, are outlined in table form in the 
appendix. (English as a -first language is omitted from this table because, for 
the most part, the differences between it and ESL are much greater and more 
obvious than those between ESL and foreign language or bilingual programs.)- 
In view of the differences discussed between ESL students and those in other 
language programs, what are tlv implications for the training of those who will 
teach ESL students? 

In the first place, unlike the typical foreign language teacher, the ESL 
teacher will not be the sole determiner of the foreign language content that the 
student will be exposed to because ESL students will be learning language in 
the elassr(K>m as well as acquiring it in the process of daily life in the U.S. 
Therefore, the ESL teacher will need to know in what ways language acquisition 
outside the classroom can best be utilized to further the learning process in the 
classroom. 

Second, because the majority of ESL students are older students, the ESL 
teacher will need to be aware of teaching methods suitable to that age group. 
Many foreign students and many adult ESL students will be highly motivated 
to learn English, hut at the same time may have false hopes about the length 
of time it will take to become fluent in English. The ESL teacher will therefore 
need great patience and understanding. Thus, the elitist attitudes once prevalent 
among many foreign language teachers clearly haVe no place in the ESL class- 
room. 

Third, because of the multicultural nature of most ESL classes, the ESL 
teacher will need to be knowledgeable about many cultures and evince respect 
for students from all of them. In the bilingual classroom, the teacher is expected 
to be bilingual and bicultural. In the ESL classroom, at least in the typical one 
where students are of many language backgrounds, the teacher will not often 
be multilingual, but she or he can become an appreciator of many cultures. 
Moreover, because the ESL teacher cannot speak all the languages represented 
in the class, she\r he has an </vtn greater need to have a thorough knowledge 
of the way all languages work —that is, a thorough grounding in descriptive 
linguistics, applied linguistics and sociolinguistics. And because English wili 
generally be the only language of the ESL classroom, tlx- ESL teacher will need 
a thorough knowledge of all aspects of the English language—that is, syntax, 
morphology, and semantics. 

Due to the factors just discussed, it is logical to assume that if ESL students 
have unique characteristics, and if ESL teaehers need unique training, then 
there should also be unique qualifications for future ESL teachers. Therefore, 
we should expect the curriculum designed to prepare future ESL teachers to 
d'^'rent from that designed to prepare future loieign language teachers or 
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future bilingual teachers. This is not, however, the case. Although the Guide- 
lines for the Certification and Preparation of Teaching English to Speakers of 
Other Ltinnuaf, in the United States were ratified by TESOL m 1975, there 
seems to be, as yet, little concensus as to what a cor.- ESL program should 
entail. The rapid growth in the number of students in ESL in recent years has 
created tremendous demands for KSL teachers. According to the preliminary 
data of the 1976-77 survey of the National Center for Education Statistics 
(Waggoner 1978:257). there were more than 100,000 teachers of ESL Yet only 
three in ten had taken even one course in ESL f Only 125 institutions .>f higher 
learning were tentatively identified as offering curricula that require courses 
predominantly in teaching ESL. 

As a result of these figures and because many foreign language teachers 
are assigned to teach ESL, we asked the state foreign language supervisors that 
we surveyed to identify areas of ESL teacher preparation that foreign language 
teachers would most need to study if the;' were to be employed as ESL teachers. 
Then we asked our audience of ESL educators and teachers who attended our 
session at the annual TESOL meeting in March, 1979, to respond to the sam^ 
question. The ten areas of preparation that received the most votes by each 
group are listed in their order of choice in Table L As is readily apparent, while 
similar courses appear in Ixrth columns, nevertheles' the focus by those specif- 
ically teaching ESL or preparing teachers of ESL is different from the focus 
of those tangential to the field This difference no doubt reflects the fact that 
priorities for ESL teacher preparation have not been agreed upon. 

TABLE 1 

Areas of KSL most important for a foreign languageVeacher to study 

Choices of state FL supervisors 
(N -23) 

*HSL methods 
Phonology and morphology 
Testing and evaluation 
American culture 
Syntax " 

Psychology of the Li learner 
Reading 

Theories of language development 
Composition 
Psycholinguisties 

Moreover, in too nuiiiV colleges and universities, only a course or two is 
offered to prepare future ESL teachers, A complete curriculum is lacking. It is 
obvious, however, that teacher training institutions will establish complete 
curricula to meet state requirements if states mandate certification or endorse- 
ment lor KSL teaer-rs, Blatchford (1977. 1979a) has extensively surscyed the 
college and university programs for training KSL teachers as well as t.tc present 
status of state reqmtemeiits for certification or endorsement in KSL. In his most 
recent survey, Blatchfoid ( 1979a, 1979b) found that only eleven states currently 
have certification oi endorsement for KSL teachers. In older to determine if 
O eie was a nationwide trend toward implementing some tvpe of KSL eertifica- 



Choiees of ESL educators/teachers 
(N -31) 

ESL methods 
Reading 

Psychology of the Li learner 
Testing and evaluation 
Syntax 

Tht-ones of language development 
Review of current literature and research 
Phonology and morphology 
Soejohnguishcs 
American culture 
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tton at the state level, we asked the. 41 state foreign language supervisors iden- 
tified by ACTFL (1978) if they foresaw the addition of ESL certification in the 
near future. It was our hypothesis that most states would not have certification 
requirements specifically for ESL nor would they plan to add such require- 
ments. Out of 23 returns, 16 responded that their states did not have now, nor 
would they have in the neai future, ESL certification requirements. Our hy- 
pothesis that there is no trend toward implementation of state certification re- 
quirements for ESL was confirmed at the .05 level by a chi-squared^ analysis. 
If one considers the universal requiicments for a complete curriculum of prepa- 
ration and for certification of English as a first language teachers and for for- 
eign language teachers, then it is obvious that ESL students are being discrim- 
inated against. The Sample Rationale for ESOL Certification developed by the 
Task Force of the New York State English for Speakers of Other Languages 
Bilingual Education Association states that: 

Non-native speakers of hntfhsh of ail axvs who haw clasM". m I.SOL are denied the 
right of having teachers eertif.ed in E&OL teach them KSOL The students in cheiu 
istr> ilas% have teachers certified in chenustrv, the students m French class haw 
teachers certified in French, hut the students of KSOL classes do not have teachers 
certified in F.SOL because there is no < ertification m KSOL ( Fanselow and 

Light 1977:257) 

If the inequity to ESI. students icsulhng fioin too -few state* requiring ESL 
cm rtiftcaticm can be remedied, then teacher tramii g institutions will be in a 
position to offer complete curricula in ESL. *fc 

< > .iven the uncertain status of ESL at piesent and llie growing number of 
ESL students, and given that there are many differences between these stu- 
dents and other language student*, can we then not argue for complete prepara- 
tion and certification of ESL teachers:* Must we not demand ESL preparation 
and certification as distinct from preparation and certification of foreign lan- 
guage teachers, bilingual education teachers; a. id teachers of English as a first 
language^ Indeed, we must and do. But not only should we demand certifica- 
tion of our teachers, but we must, as a professional organization, insist upon 
setting the curriculum for our future teachers. As a professional organization 
we should assume 'responsibility for instituting a standard body of knowledge 
that all teachers trained and certified tu ESL would share. Increasingly, teacher 
organizations, like other professional associations, are asking to control certifi- 
cation of those entering the professum. We as specialists, also need to accept 
this responsibility so that we may ensure quality ESL instruction for our stu- 
dents. We, after all. aie the ones most aware of the ways m which the E&L 
learner is different from the leainei in bilingual education programs, the foreign 
language learner, or the learner m English as a first language classes. We must, 
therefore, not only insist that ESL teacher certification oi endorsement be re- 
quired, but we must take an active hand in piesci firing the curriculum necessary 
for future ESL teacheis Wnl'out the acti\e involvement of the TESQL pro- 
fession m delineating its boundaites, quality ESL instruction as well as its status 
will akvavs remain unceitam 
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BILINGUAL 



ESL (MONOUNGUAL) 



FOREIGN LANGUAGE 



Funding due to bilingual acts and 
as a result of Lou V9. NichoU. Fund- 
ing generally excludes non-residents 
and aliens who compose the majority 
of ESL students in American uni- 
versities. 


Heavy funding in the '60s from 
NDEA grants as a result of post- 
Sputnik need to improve math, sci- 
ences, etc. as a direct result of the 
Cold War. 


Heavy funding from Bilingual Acts 
as a result of various court decisions. 
Bilingual students generally U.S. res- 
idents and therefore qualify for fund- 
ing. 


Ultimately pragmatic for purposes of 
business, academics apd professional, 
whether instrumental or integrative. 
Monetary opportunities often part of 
motivation. 


Aesthetic; liberal arts requirement; 
travel. ( Spanish exception to this in 
some areas.) 

* 


A «:milation; peer pressure to speak 
like other children of same age. 


Natives of Li study L, in L a environ* 
ment. 


Natives of L t study Lj in Li environ- 
ment. (Some regional exceptions.) 


Natives of L> study L and L, in L, 
environment. 


Wide range of ages, especially in 

, riciiitriiuw/ Hiiu univcrjiiy icvou . 


Mostly high school and university 

CIUUIIIIIvllli 

-i - 


Mostly elementary age children. 


Especially in university programs, 
more students in advanced Ibvels. 


In both high school and college most 
students in first and second year 
level, 


Basic concept formation; develop- 
ment of academic basics. 


All levels; at advanced college level, 
great concern with perfecting read- 
ing and writing skills. 


All levels, but majority of students 
at beginning levels. At advanced 
levels, great concern with literature. 


All levels; emphasis on developing 
reading and writing skills. 


Multicultural; emphasis on under* 
standing culture so as to effect better 
communications competence. 


Appreciation of L» culture. 

25m 


Maintenance of cultural identity of 
native language while learning to 
appreciate culture of L*. 
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background 
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professional 
background 
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LANGUAGE 

ACQUISITION 

THEORIES 



RESEARCH 



PEDAGOCICAL 
APPROACH 



TESTING 



WHERE HOUSED 



Majority of teachers L, native speak- 
er! teaching L,, 


Majority of teachers L, native speak- 
ers teaching La. 


Majority of teachers bilingual, teach- 
ing Li and La. 


Majors in: 

1) Foreign languages, 

2) Elementary/ secondary 
education with emphasis * 
on English 

3) Few majors in ESL 


'Most college majors in foreign lan- 
guages. 


Majors in: 

1) Elementary education with 
emphasis English 

2) Elementary education with 
emphasis bilingual 

3) Foreign language 


I* language acquisition . similar to 
L, acquisition. 


L 3 language acquisition not similar 
to L acquisition ( cf . especially 
Brooks (1966). 


La acquisition similar to Li acquisi- 
tion; emphasis on natural sequences 
in child language development 


Emphasis on study of language ac- 
quisition in both children and adults; 
little use of L, survey research to 
investigate attitudes toward L,. 


Emphasis on study of language 
learning, largely in high school and 
college learners; use of surveys in 
Li to investigate attitudes toward La. 


Emphasis on child language acquisi- 
tion; studies of effects of bilingual 
education on basic achievement. 


Integrative approach; stresses sur- 
vival in American English-speaking 
communities and in American school 
and college system. 


Assumes a homogeneity of student 
populace; not geared toward teach- 
ing academic survival needs. 


Integrative approach; maintenance of 
L» while developing fluency in La. 


Mostly, integrative approach; not 
concerned with piecemeal knowl- 
edge. 


Mostly discrete point rather than in- 
tegrative in large* majority of first 
and second year classes. 


Language dominance testing; basic 
skills in Lt and La. 


Subsumed under other departments 
such as foreign language, English. 


Separate department or facility. 


Separate department or facility or 
housed with language arts, foreign 
language, English or K-12 education. 
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With the recent influx of non-English speakers to many U.S. urban areas— 
primarily fndochinese refugees and Spanish-speaking immigrants — many non- 
ESL teachers find that they need TESL .skills to deal with their pupils. As a 
result, many ESL teacher-training programs attract non-traditional TESL stu- 
dents such as bilingual education teachers, paraprofessionafs, adult basic educa- 
tion volunteers, and public .school teachers with non-English-speaking pupils 
in their classes. The challenge to teacher trainers is to prepare a compact, one- 
term course to meet the needs of such a diverse population. 

Part . One of the paper reviews the recent literature on teacher training in 
ESL and provides a summary of the mam issues as they arc revealed in this 
literature. A bibliography is included. 

Part Two describes a one-semester "uou-traditioual" TESi^ methodology 
course developed at the University of Colorado at Denver to meet the needs 
of a diverse non-traditional TESL student population. It is argued that teacher 
trainers need to focus on specific techniques, and to address theoretical issues 
only when they arise in discussions of those techniques. In presenting a tech- 
nique, trainers should make explicit 1) the ESL proficiency level, 2) the skills 
and 3) the type of pupil for whom the technique is appropriate. A three- 
component model is presented for implementing this approach. A course syl- 
labus is outlined and sample classroom procedures are described. 

1. Introduction 

If our experience at the University of Colorado at Denver is indicative of 
a trend throughout the United States, teacher training programs will increas- 
ingly have to offer courses to meet the needs of non-degree students seeking 
skills in teaching ESL. Many of these students arc teachers untrained in TEFL 
TESL niethodology and linguistics who have recently discovered their classes 
filled with non-English speaking pupils. Faced with this situation these teachers 
have two alternatives: (1) ignore the needs of the pupils or (2) .seek assistance 
in acquiring the knowledge and skills to work with non-English speakers. Much 
to their credit, these teachers in ever-increasing numbers are choosing the latter 
alternative. 

The challenge to teacher trainers is to prepare a compact, one- term course 
to meet the needs of these non-traditional TESL students. Part One of this paper 
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reviews the recent literature on teacher training. Part Two presents a model for 
a one-term TESL course in wlijch theoretical issues and practical classroom 
techniques are integrated. / 

2* Review of literature 1 

There is considerable agreement in the literature on the question of what 
constitutes the ideal ESL teacher. Early articles by Brooks ( 1966), Cloze ( 1966) 
and Marqwardt (1969) emphasize the interdisciplinary nature of the training 
of second language teachers, a view which is echoed later by Strevens (1974) 
and Alatis (1974). Alatis' article is representative of these points of view. He 
offers the acronym LAPSE as a framework lor teacher preparation: Linguistics, 
Anthropology. Psychology, Sociolingui sties, English Education. Guidelines for 
the preparation of language teachers have been developed by the Modern Lan- 
guage Association (1966), the National Council ot Teachers of English for 
English Teachers (1968) and TESOL (1975). The guidelines vary in a number 
of details, but all attempt to list the personal and professional qualities of the 
ideal teacher. In general, they advocate entrance criteria which assure native or 
near-native proficiency in the target language and personal qualities which 
would allow the individual to understand and take advantage of cultural differ- 
ences in the classroom. In addition, they present outlines of course offerings 
which cover a wide range of academic areas such as those contained in Alatis' 
LAPSE approach. They also recommend some sort, of practical experience in 
language teaching as part ot the trainees program 

While educators seem to agree on the qualities of the ideal teacher, there 
is less agreement on the most effective procedure for training teachers. Articles 
which focus on the process of Vacher training can be divided roughly into two 
categories. 1 ) those which . .• Methods oriented and 2) those whicn are Tech- 
nique oriented- 

The Methods-level aiticki grapple with the issues of program develop- 
ment and general orientation of training programs: Strevens (1974) highlights 
the characteristics of a good teacher and sketches the outline of a training pro- 
gram. Alatis (1974) goes one step further and reports on 'the evaluation of an 

.niffhi T^/r ,eV ft lS Uy meanS nha,ls,lve ' wt " a«emPt to highlight recent work which 
Ki t offer spec.nc r progrmiima ic suggests or the teacher trainer. An excellent review of 

L,i I n-?7 prHd J"V n 1mk1 V ,,0nm * h ,M '*'JK«Phy <>f> trachea tuinin,! f„ ESOL^ hi- 
fi ' I"' 1 *" ] * mia * v baching is contained in Fanselow and Light (1977). Fanselow 
;?u , ni V nU K r iln ,,nP °:! anl P<l, , nt "in- mtnnliirtian to the book. "In reviewing th 

^.W^n in th, journals related to language teaching, not only is there 
a dearth of ..rhcles of a research nature, but in fact, there are very few selections that treat 
eaeher preparation at all.' (1977. 9). It might I*' added that the ma,cmtv 7the art 

'A U T' UvT P r( 'P aration A ™ of a theoretical or issues nature; there are very few 
which provide specific suggestions for teacher trainers. 

' ll»Vl M |« h a"' ' T £ ,hn,qU ? ',. art> Use(1 ,oos ? ,y h . er< ' in th <* * c ™* ^eloped by Anthony 
»V h uZ** hl \y nSOt ihe * ni <}' tU ™ mentioned earlier are o n the Approach level because 

L&'t 0 *' st,|f L 0m * qi r ti0ne< trllths I ,r 1,xio ™ of preparation^ The MaVhoTKS 

S «rneral procedures and intent while the Technique level articles f^ufon 

spec fic strategies *nd activities. It is assumed, of course, that in a particular program, there 
2^, , ^^ p,e ^ l w ?f ,,,l > f l m,on K ^ ^vels, that is, the P techniques used would 
S^the ? JSt^^ WHiCh * M «" * well-developed^aV 
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M.A. program at Georgetown University. Lee ( 1974) and Taylor ( 1978) argue 
for a more practical focus in teacher-training classes while Jarvis ( 1972 ),- Combs 
(1972), Blatchford (1976) and Dowling and Shepard (1976) emphasize the 
need to focus on the subjective, humanistic side of teacher training so that 
teachers can develop the ability to make accurate assessment of their students 
problems. A substantial body of literature on the Method level is devoted to the 
pros and cons Of competency and performance-based teacher education (CBTE 
and PBTE). Growing directly or indirectly out of researchers" attempts to 
identify the classroom behaviors of good teachers (see, for example. Fanselow 
1977, Lee 1972, Moskowitz 1976) the movement tor CBTF PBTE is founded 
on the belief that basic competencies can be identified, isolated, and then im- 
parted to prospective teachers in a teacher training program, Supporters ot 
one form or another of CBTE /PBTE (for example, Fanselow 1977, Lee 1972, 
Schmieder, et al. 1977) point to the need for objective criteria for judging the 
acceptability of the training of future teachers. Critics (for example. Blatchford 
1977, Resolutions 1977) point out that there is a great deal of work to be done 
before we can confidently isolate specific behasio.s which characterize good 
teachers. Indeed, in a review of the research literature Cyphcrt ( 1972) comes 
to the conclusion that there is little available empirical data which illuminate 
the relationship between teache. behaviors and student progicss Although 
CBTE PBTE-related articles usually otter quite derailed ieeommeiidations. the> 
are classified as Method-level because the> rarch provide specific suggestions 
for teacher training techniques. 

Articles on the Technique level cover a wide lange of issues and techniques 
in teacher training Orwen (1971) otters a blueprint for the Methods course, 
providing a framework for team teaching by trainers and peer teaching b) 
trainees. McArdle's (1971) article cuts across the Method Technique classifi- 
cation we have established hut is included here because he otters a detailed 
framework for evaluating the knowledge and skills ot trainees. A number of 
articles suggest ' refinements in microteachiug. a technique in which trainees 
teach a short (five to fifteen minutes) lesson to a group of ESL students, usually 
focusing on a specific language problem. Politzer (1966) is widely recognized 
for his use of micro-teaching techniques. Modifications in the use of micro- 
teaching have been explored by Carver and Wallace (1975). who combine 
micro and peer teaching; by DcLorcnzo ( 1975). who found that experience wiHi 
microteaehing reduced anxiety in student teachers. -and by Dugan (1967). who 
developed an intensive summer institute around nncioteaching Forrester ( 1974) 
and Moore (1977) provide tips tor using the demonstration lesson in which 
trainees observe and critique a lesson as it is being taught 

Implicit in most articles on teachei tiaining is the assumption that the 
trainees are either full-time students or that they aie a iclatixely homogeneous 
group in terms of teaching experience, prior training, caieei plans, etc Unfor- 
tunately, such assumptions cannot he made about the students attracted to the 
TESI. methodology course at the University of Colorado at Denver. Part II 
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provides a description and explanation of a one-term course offered at UCD 
for a diverse population of non-traditional TESL students. 

3. The one-terra TESL course 

Need. It is, of course, every teacher's fantasy to have relatively small, 
homogeneous classes. Events of the past few years appear to have made Mich 
fantasies even less likely than before, With the recent influx of non-English 
speakers to urban areas, many noii-ESL teachers find that they need ESL skills 
to work effectively with their pupils. As a result, ESL teacher-training programs 
are increasingly attracting non-degree students seeking a one-term, all-purpose 
course which they hope will prepare them to work with the non-English speak- 
ers in their classes. This group on "iion-traditional" TESL students includes bi- 
lingual education teachers, paraprofessionals, adult basic education volunteers, 
public school teachers with non -English speaking pupils and an occasional 
walk-in, that is, an inteiested undergraduate or special student These non- 
trad, tional TESL students bring a wide variety of knowledge, skills and ex- 
perience to the TESL classroom, some are experienced teachers in another field, 
others have never taught, some have a thorough understanding of linguistics and 
English structuie, others have none, some have worked extensively with people 
from other cultures, others have no cross-cultural experience, some speak several 
languages other than English, others are monolingual English speakers, some 
are interested working with children in "traditional" classroom settings, others 
are interested in the "non^raditional" education of adolescents or adults. The 
list could be extended indefinitely It is a curious and confusing array of people, 
but for teacher training prngiams in large urban areas, this is the reality of 
higher education. 

The dilemma v\hich teacher trainers face in this situation is, of course, 
classic. How does one give the huinees techniques they can use and the theo/ 
retical information required to make the most effective use of those techniques? 
The methodological framework described below was developed in response to 
precisely this question. Although developed for use in a college or university 
setting, we believe it is a format which, with a few adjustments, would prove 
ideal for the 1 user vice training needs of practicing teachers. The priorities for 
the course, in order, are- 

1 ) to provide students with a bag of tricks, a number of classroom tech- 
niques which they can use immediately. 

2) to provide students with the ability to adapt material from a variety of 
sources and to write their own materials. 

J) to provide students with the opportunity to observe and analyze master 
teachers of ESL in a variety of classroom situations. 

4) to provide students with the opportunity to practice a variety of tech- 
niques. 

5) to provide students with an understanding of the lesources available to 
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them (i.e.,* professional organizations*, professional literature, local re* 
source centers and libraries, materials available, etc.)- 
6) to provide students with sufficient theory to understand the implications 
of the choices they make. 

The Confounding Cube, Given these general priorities, the trainer must de- 
velop a syllabus . which systematically addresses the needs of all the trainees. 
These needs, in terms of the context in which trainees might be teaching, can 
be represented graphically by the Confounding Cube of Fig. 1. 

Figure 1 
The Confounding Cube 




Level 
a. Beginning 
I). Intermediate 

c Advanced 



2 Skills 



3 Students 



a Listening 

b. Speaking 

e. Beading 

d. Writing 



Public School 

1 ) Elementary 

2) Secondary 
Intensive Course 
Adult Basic 

Education 



In teaching one-term methods courses to heterogeneous groups of trainees, teacher 
trainers must be aware of the diverse challenges which the trainees will face in the 
field. The various combinations of levels, skills and students — represented graphically 
by the Confounding Cube — must be systematically addressed in the methods course. 

Because the trainer cannot assume that everyone in the class will be teach- 
ing in the public schools, or in an intensive English center, or that they will be 
focusing on oral aural skills oi that they will be working with students /of the 
same ESL proficiency, s he must attempt to systematically touch on as many 
combinations of the cube as possible. Used conscientiously as a framework for 
preparing lessons, the cube enables the trainer to avoid the oft-cited (for ex- 
ample, Orwcn 1971, Paulston 1973) weaknesses of Methods courses, that they 
represent the experience and expertise of the trainer more than a systematic 
review of possible methods and techniques. Using the cube in this fashion, the 
trainer makes explicit the applicability of a particular technique. Figure 2 shows 
one possibility. 

Trainees lot whom such a lesson is appropiiate would benefit 'immediately 
from the presentation while others vould be aware of the adjustments they 
would need to make to ell actively use the technique in their classrooms. Such 
a presentation is e.pecialK valuable for the neophyte who begins to acquire an 
appreeiati ;n lor the complexity of the field and an understanding of which tech- 
niques aie appropiiate for which student population. The cube also permits the 
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Figure 2 
Focus of a Sample Lesson 




An example of how the Confounding Cube is used to focus a presentation; this lesson 
would present a technique for teaching writing to intermediate- level ABE students. 
Trainees interested in, or working with, different students of*t different levels would 
be expected to make the necessary adaptions to the technique. 

trainer to check him /herself in planning the course to meet the needs of all 
trainees in the class. 

Three component systent Many authors have called for less theory and 
more technique in TESL methods courses (see, for example, ' jwen 1967, Alatis 
1974, Lee 1974, Strevens 1974, Schmieder, et al. 1977, Taylor 1978). Arguments 
for such an emphas. are compelling; i.e.. that in short-term training, it is im- 
portant to give trainees something they can use immediately; that it is the re- 
sponsibility of a trainer to enter a course with an open mind, prepared to give 
trainees what they need rather than what s he happens to enjoy teaching, etc. 
As Lee (1974:37) states, "getting down to the grass roots in the training of . . . 
teachers means getting down to the detail of what they have to teach, but not 
necessarily to the deti- of any i'>derlying theory Bowen (1987:32) supports 
this perspective: "Short term training should . . . limit itself to surface struc- 
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turc, to teaching the application of the host and most pioduttive of current 
teaching techniques presented as effectively as possible " 

Of course, none of these authors advocate the elimination of theory from 
methods courses, but their experience has led them to the conclusion that a 
course which emphasizes the practical over the theoretical will be the most 
effective. Unfortunately, most write rs oiler veiy little specific information on 
how a trainer is to integrate theory and practice. Most would undoubtedly agree 
with I At* (1974 37) "General principles, if it is possible to puisne and find , 
them, an* better glimpsed and then, perhaps, tracked down through the detail 
(of teaching techniques) met v'tli lathei than lcccitcd as announcements/' 
However, the three-component fiamework presented here is, to our knowledge, 
the first detailed attempt to show how theory and tcchnique.nught be effectively 
integrated in a .short-term methods course. 

The three components are Technic pie. Practice, and Theoiy.' They are 
presented to the trainees at the beginning of the session as the, framework 
within which everything m the course wili < v akc place. Itjs made clear that: 
1) the highest priority for the course is fe* trainees to atmire a bag of tricks, 
a portfolio of techniques for use in thvir classes, 2) nicst of the techniques 
presented ^wnT have to be modified somewhat lor then use, and 3) an under- 
standing of the theoretical basis and implications of a technique is essential 
if they, aie ESL teachers, are to be efficient, effective decision makers rather 
than mere performers' 1 ( a la Jarvis 1972) 

With the above information as background, the trainees are then piesented 
with a series of techniques which they practice and then examine iroin a theo- 
retical point of view From the trainei s point of view, there are a number of 
options available under each component. Foi oui purposes a technique is a 
specific classroom activity for teaching or testing a language' skill, or grjynmar 
vocabulary point. The essence of the Technique component is its brevity and 
the cxpll ltnoss with wl h the behaviors of the teacher and the students are 
explained and demonstrated. Pi act ice in tins context is any activity which pro- 
vides trainees contact with the real world, it might take the form of observation 
of a master teacher, imcrotcaehiiig, group discussion, individual adaptation, etc. 
The Theory component provides an opportunity -to step back from the tech- 
nique, to establish the context in which the 1 technique mnjlit be used, to explore 
the implications of the technique for the students, the syllabus, and the teacher. 
This might take the foim of class oi small group 'discussion, a mini research 
paper, or a reading assignment, Chait 1 piovides a jrieture of the options avail- 
able to the trainer. 

In our experience, the 1 most effective use of time provides loi appioximately 
50? for technique. 2">V for practice, and 25V for theory These pcicenhiges would 
applv both to individual lessons and to the couise ns a whole. Thus, the comi'sc 
is developed around \ uuinbei ol technique picsentations with time stiuctuied 
so that tiaiiieos have the oppoitninty to work with the techniques and to reflect 

1 Theorv is used here to mean a rationale which modes classroom ptucttee, theory en' 
compasses approach and method as defined hy Anthony (190.1). 
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CHART 1 

A 3-Component Mode! for Toucher Trtflmntf 



Technique 

Presentation by 

trainer 
Demonstration lesson 

Micro-lesson 
Shock language 
lesson 

Film, filmstrip 

Slide presentation 
Video tape 



Practice 
Observation of class 

Small-group 

discussion 
Individual adulation 
Lesson-plan 

development, 

adaptation 

Iu-turn demonstration 

Case-study problem 
solving 



Theory 

» or small-group 
k isc ussion 
Out ule rc.idintf 

Mini-research project 
i i.mier-tacilitatcd 
discussion 



/ 



upon the theoictical implications ol 'die techniques. ( It is important to keep in 
mind that each technique is presented with specific lelerenee to a particular 
skill, language eompett nee level, and student population. Thus, it is necessary 
for the inajonjtv of tiainees to adapt the technique hetoic it will he of use to 
them.) 

At this point the astute icadei will Kiise objections. ' There is nothing new 
heu\ What makes this different from the average methods eouise J How does 
this couise avoid becoming mctck a series of ESL workshops or msemces? Li 
terms of content, there is nothing new !»er<\ and an observer might not be able 
to distinguish this coinse from anv number ol TESL methodology courses taught 
around the country. And the major pitfall of this approach is, o, r course, that 
the series of techniques will become merely that, a series of techniques, rather 
than pieces of a mosaic which acquire meaning as part of a whole. 

The unupieness of this proguun can be evaluated fron, the perspective of 
the trainer and the trainees The trainer is given a framework vvhuh permits 
hhn her to systematically relate theoiy to practice. Although the focus of the 
course is a series of techniques, this is only the surface structure of the class, 
fact, in planning lessons, the trainer first establishes the tneoretical issue*: 
Jx addressed ancb then selects the tcchnique(s) which will illuminate tl 
issues. The effect is that the techniques are foregrounded, hut there is alv 
the partially hidden agenda of the theory winch they exempli**'. The tra 
thereby avoids the temptation to dwell on theory to the detriment of practice 
01 to overemphasize pet techniques 

From the trainees' point of view, the course provides a selection of specific 
classro >in techniques which are immediately available for use. In addition, the 
tiainees acquire the essential skill {which is not always specifically taught) of 
adapting techniques aiixl n ,,4 >ials for then own use. And, finally, the trainees 
acquire, bit by bit, a, theoret.cal perspective on what they aie doing in the class- 
room. The constant insistence of the tieniei to establish a link between tech- 
niques and theory results fhopcfnlk ) in the trainees' appieeiation of the insep- 
arabilitv of the two. 
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4, Course outline and lesson plan 

' / 

Appendices I and 11 contain a sample course outline' :fnd a lesson plan de- 
veloped according to the above liamework. The s\ Halms* has been adapted 
slightly from the course outline presently bring used in a method* class at the 
University of Colorado at Denver, it is designed to cover a fifteen-week semester 
of classes which meet once a week lor a three-hour session. The syllabus is pre- 
sented as one example ol how the above model might \w implemented, ob- 
viously, the needs and experiences of other, trainers will dictate modifications 
before the syllabus could I e used elsewhere. 

It is important to stress that preparation of the course ( and individual 
lessons) proceeds horn theory to technique. That is, the tramer first develops' a 
list of theoiclu-al issues that s he wishes to covei in trie course of the semester 
and then selects specific techniques which exemplify those issues. However, a 
major part of class time and virtually all of the asNignments focus on classroom 
technique. 

It will be noticed that Rivets and Teniperley (1978) is shown as the text 
for the course. Theie aie, ol course, a number ol good texts which might be 
Used for example, Einocehiaro ( *974) and Paulston and Binder J 1976). In 
addition to the text, we have suggested a number of articles which focus on 
specific teaching and testing techniques. VV<* have kept the readiiig to a min- 
iriium because a major assumption of the course is that trainees will be spend- 
ing a considerable amoirit of tunc searching out techniques which "are relevant 
to their needs. We felt, therefore, that an extensive required reading list would 
preempt their flexibility m finding lesources of specific^ value to them. 

One further comment about the course outline, it should be noted that' 
special attention has been given to the three ureas of skills, level, student char- 
actenstics of the "confound nig cube" in developing the sequence of issues and 
the presentation of techniques The sequence in which ESL skills are addressed 
is probabK fairly standard for most methods courses. With reg'-ml to techniques, 
we have attempted to pio\ide a balanced presentation of techniques which 
would be appropriate foi beginning; intermediate, and advanced ESL students 
in programs at public schools, intensive English centers, and ABE programs. 
The proportion of time that a trainer devotes to any of these areas would de- 
pend, ol course, on the paiticulai needs ol the trainees -in the class* 

The lesson plan is presented as a specific example of how the framework 
developed m this paper nught translate into the classroom. The amount of work 
is ambitious but tfv uniqueness of the approach is in the presentation of as 
many techniques is possible in the time allowed. If it becomes neccssprv to 
reduce the amount of matenal covert^ in class, vve generally choose, to sacrifice 
the tbeoiv section Discussion of particular techniques often overlaps " Uo sub-* 4 
sequent sessions and, il a paiticulai ly important issue requires nu . time, a 
theoretical question can be assigned foi trainees to ponder for the n' \t session. 

In our experience, the emphasis on technique latlicr than theoiy lias made 
it possible for all students to benefit horn' class sessions Tiainees with ESL or 



A Training Model 



259 



L2 teaching experience tfcneially i-cniti ilnitt- unexpected insights to class dis- 
cussions, and they often ask questions which force the discussion to a higher 
theoretical plane Neophytes to RSI , on the other hand, hcqut;ntly ask ex- 
rremelv naive (juestions, whose value lies in folding the tiamer and more ex- 
plicated trainees to cart-hilly evaluate veiy fundamental assumptions ahout 
1,2 teaching. This fruitful interchange t>f ideas flows directly, we believe, from" 
fhe mimric evanunatiun (if techniques, if the lotus were allowed to shift from 
technique to thctuv picmatuielv , the less cvpeiicnccd students would not be 
•able to contribute to 01 benefit hoiu the discussion , 

It should be noted that a significant portion of each clasv^session is devoted 
to adaptation and demonstration" (if 4ecluuques by the students It might be 
thai out- of the techniques is initially denionstiated In a tiamce, 01 that a trainee 
is icqmied to deinonstiatc a U chinque mtrotluccd bv the trainer Also, we have 
had success with small-nrtMip demonstrations, trainees seem less hesitant to 
adopt student teachei loles m front t)f fevvei people 

^t^ardless of hou a technique is presented ami piacticcd, trainees are 
consistently requued to uenciate tec hneiues (Stevick l l )72a ) foi improving the 
rhvtjun of class activity Technemcs aie small variations m a technique which 
ittsult unmpiovujg the appiopi lateness of tht technique foi the learners (eithei 
bv challcn^m^ the fastei pupils 01 bv heSmtf the slowei pupil )/ Students aie 
aUo ictpmetl to evaluate and adapt techniques and matt-iials' " according to 
Stevicks (l l )72b> cntena of stiemjth, tlepth and lightness. The intended eflect 
is that, although onk five 4 or si\ techniques may be presented in one lesson, 
each strident will have pioduted meaningful /vanations of each technique for 
his her own use 

In conclusion, wjpiaimot claim that f !ic hamevvoik pieseutetl here piovides 
the ultimate scjution loi teacher trainers who aie faced with a bewildering array 
of tiamee expectations foi the onc-tenn methods eouisc The demands of EFL 
KSL instruction are !ai ((k> complex foi ultimate solutions:' Hovvevei, we can 
it-port that the approach outlined m this papei has increased the general level 
of satisfaction with the one-teim couise The focus on practical techniques, bal- 
anced by the tiameis consistent efloits to i elate these techniques to theoietical 
issues, has produced <n expciiente m which tiamees atqiuic skills which they 
can use on Monday morning coupled vuth an appicciarion for the theoretical 
foundation on which the techniques lest 
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Hit- trainee makes other arr.uigeuu-iits, the lest (if the class will sei\c as 1 SI. students 
lor the demonstration. 

3 Sis hours of observation of master KSL teachers Hi ports on the obsei vat ions are 
to lie hiifl (2*3 pages) ami arc to focus on the spec if ic ttchmquts used 1>\ the 
teacher A siimniary statement should indicate how the techniques < oulfl he adapted 
tor other teaching situations 

t Kmal is •mmation on readings, disi nssions I lie evain will (o\er both knowledge 
and skilK acquired m the course 



Course Syllabus: 



Session 

1 

2 
3 



8 

9 

10 
11 



12 
13 
14 



15 



Issue 

Structural Pattern 
Prills 



leaching Listening 
Skills 

( 2 niiiu-rei)orts 1 lue ) 



leaching Speaking 
Skills 

{ Observ at ion repurts- 
3 hrs. due) 

Teaching Heading 
Skills 

{ 2 mini-reports due ) 



Teacl ig Writing 
Skills 



testing KSK 



l*mai hvani 




Techniques 

-Shock lesson (Arabic) dem- 
onstrating repetition, substi- 
hon, transformation drills 
— Dialogue adaptations 
—Mechanical, Meaningful, 
Communicative drills. 

— Dictation 

— Lecture note taking 
--Total Physical Response 

— Sound discrimination exer- 
cises (from Morley, 1972) 
— Use of pnppets 

— Minimal pairs 

— I'm- of props (inurors. 

( harts, etc ) 
Stress .i d intonation 

m drills 
—language experience 
— Teaching vocabulary 
—I sing unedited texts 

— reaching skimming & 
scanning 

— Handwriting 
—Controlled compositions 
— "Free" composition 

techniques 
— Evaluation techniques 



— Testing ord aural skills 

(dictation, interview, oral 

clo/e tests) 

Clo/e test construction 

—Constructing distiactors for 

multiple choice tests 
—Correlating test Jesuits with 

standardized tests 

— Hequfres students to eval- 
uate and adapt specific tech- 
niques matciials, tests 



Heading 

McCready 1972 
Paulston 1972 ' 
Hiver 1978, 

Chpts 1, <t 
Stewtk 1972, 

a, b 

Morley 1972 
Riven 1978, 

Chpt 3 
Sutherland 1967 



Risers 1978. 

Ciipt 5 
Rolnnett 1972 
Strain 1972 

Clarke and 

Silberstem 1977 
Rivers 1978, 

Chpts 6, 7 

Hncre 1966 
Kaplan 1968 
Knapp 1972 
Lawrence 1972, 1977 
Rivers 1978, 
Chpts C, 9 
Raimes 1976 
Seward 1972, J979 
Seager* 1972 

ila.ris 1969 
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Course Readings * 

Allen, H. and Campbell, H. 1972 Teaching English as a Second I anguage (2nd Kdition). 
New York /McGrau -Hill. 

Briere, E, 1966. "Quantity before Quality in Second Language Compositions" Language 
Learning 16.3*4, 141-151, 

Clarke, M. A. and Silberstem, S. 1977. Toward a Realization of Psycholinguist ic Principles 
in the ESL Reading Claw Language Learning 27 135-154. 

Croft, K 1972. Readings on English as a Set and Language. Cambridge. Mass. Wmthrop Pub 

Harris, D. P. 1969. Testing English as a Second Umguage New York: McCraw-IIiil. 

Kaplan, C. 1968. Guided Composition. New York: Halt, Rmehart and Winston. 

Knapp, D. 1972. "A Focused, Efficient Method to Relate Composition Correction to Teach- 
ing Aims.'' In Allen and Campbell, 213-221. 

LawTence, M. 1972. Writing as a Thinking Process. Ann Arbor: University of Michigan Press. 

Lawrence, M. 1977. Heading, Thinsmg vml Writing Ann Arlx>r University of Michigan 
Press. 

McCready, G 1972. "Developing^ Lesson Around a Dialogue" In Croft 1972, 106-114. 
Morley, J. 1972. Improving Aural Comprehension Ann Arbor. Univcrsty of Michigan Prrss. 
Paulstoti, C. B. 1972. The Sequencing of Structural Pattern Drills In CroJt, K. 1972, 148-162. 
Ramies, A. 1976. "Composition. Controlled by the Teacher, Free for the Student" in 

j, Fanselow and Crymes, On TESOL 76, Washington, D C: TESOL, 18-3-194. 
Rivers, W. and Temperley, M. S. 1978 A Praittcal Guide to the Teaching of English as a 

Second or Foreign Language New Wk. Oxford University Press. 
Robinett, B. W 1972. "Simple* Classroom Techniques for Teaching Pronunciation." In Croft 
1972, 80-86. 

Seward, B H. 1972. Teaching Cursiw Writing to KFL Students*" English iMngnage Teach- 
ing 26.2, 169-178. 

Seward, B. II. 1969. Handwriting m English A Practice Hook. Denver: Th*» Center for 

Research and K ducat ion. 
Seager, W. 1972. Classroom Techmciues for 'Controlling Composition In Croft 1972, 232-244. 
Stevick, E. W. 1972a. Technemes and the Rhythm of Class Activity.** In Allen and Campbell 

1972, 88-94. 

Stevick, K, W. 1972b. "Evaluating, and Adapting Language Materials" In Allen and Camp- 
bell 1972, 101-120. 

Strain, J. E, 1972. "Teaching a Pronunciation Problem." In Croft I9Z2, 71-79. 
Sutherland, K. K 1967 "Dictation in the language Classroom.*" English Teachtnti Forum 
5.2, 10-14. 

APPENDIX II 

SAMPLE LESSON PLAN 

Technique #1. Handwriting Exercnes 

1. Shock lesson: Arabic script. Trainees are presented with the Arabic alphabet, 
allowed to struggle for a short time with the symbols (Arabic is vsrittcn from right 
to left, vowels are not normally represented in cursive, and the form of many letters 
crumbs dramatically when attached to other letters). Trainees are then introduced 
to the concept of orgam/mg the letters according to distinctive features, and teach- 
ing the letters in groups 

2 English presentation Trainees practice with English graphemes organized on the 
same model (from Seward 1972, 1979) 

Example, The e group t 
. e, i, u, t Us 1.0 svo j£T 

3 Discussion, clarification. 

Technique ^2. Controlled Composition Sentence Combining 

I. Trainees are presented twth everuses in sentence combining, told to v\oik the 
exercises. 
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Example. 1 The salesman ate ten sandwiches The salesman felt sick 
2. I took my umbrella to work It was raining yesterday * 
(From Lawrence 1977, 77) 

2. Tiamecs produce similar exercises from, various texts* intended for a yanety (in terms 
of both level and age) of KSL learners. ■* . 

3. Two trainees write excerpt? from their adapted exercises on the ward and explain 
(using Stevick 1972b) the importance of their adaptations. 

4. Discussion, clarification. 

Technique #3: Controlled Composition Scrambled Paragraphs 

1. Trainees work with scrambled paragraph exercise, attempt to discover optimum 
sentence order. 

Example The seven sentences Mow constitute a paragraph, but they have been 
printed in a disordered sequence. Reconstruct the paragraph. In the space 
at the left put the number of the sentences in the order in which they 
should occuc. 

1. Furthermore, tacos, enchiladas, chili, and frijoles are common fare foi 
western tables * 

2. Although the Spanish political and military influence has long since 
disappeared, some influence remains in the continuing use of many 
words of Spanish origin. 

3. The western and southwestern parts of America were once under 
Spanish control 

4. Most western Americans know -what is meant by loco, amor, and 
pronto 

^ 5 This infiltration of Spanish into English haj added color and a certain 

special flavor to our speech, 
(From Kaplan 1968, 131) 

2. Discussion, clarification. 

Technique #4: Free Composition: Quantity before Quality 

I. Trainees are introduced to the concept of free association writing, told to write as 

much as possible in 3 minutes on any topic. 
2 Clarification The rationale (see Rr.ere 1966) is explained, and the technique is 
explained in detail 
Technique #5: Controlling Free Writing 

1 Trainees are presented with a topic, picture, film or filmstr.p and given a specific 
writing assignment (i.e.. Compare and contrast these two pictures, Describe what 
happened in the film or hlmstrip, ) 

2 After the assignment is completed (5-10 minutes) the trainess read their composi- 
tions to the class 

3 The l>est compilation is written on the board 

4. The composition is analyzed for rhetoncai devices Trainees are asked to identify 
the rhetorical devices which are required by the task (sec Ramies 1976). 

5 Ramies ( 1976) is assigned for homework Trainees are asked to generate assir ments 
winch, because n* the topic and the assignment, control the ESL students' ting. 

Technique #6. Evaluating Writing Knupp's checklist. 

I. Trainees evaluate an actual ESL composition using Knapp's checklist ( Knapp 1972, 

trainees have been required to read Knapp's article before class — see SAMPLE 

COURSE OUTLINE, Scsskk, 10), 
2 Group discussion compare results, discuss rationale for selecting, marking errors, 

discuss future assignments for die u liter of the composition 

Synthesis and hvahiatior 

The discussion 'ftc each technique has been held to a minimum, the primary objective 
being to clarify the purpose t .nd implementation of the technique This final synthesis and 
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fvalttatam is intruded as an oppojiumt\ to »\p!oie the theorc Ik at implications ci| the tech- 
niques and to discuss ways in whuh the U i hniques might In- integrated into the curriculum 
trainees are em mi raft d to rdate the techniques to the assigned readings and to eval- 
uate the appropriateness of cadi tn lnii'jiie to their paitn uTtr situation '1 he trainer acts 
as a facilitator, guiding tlie discission (as ninth as possible ) to the following points 

1 I he plate of haudw ritmg uistrm tion in the KSI. i iirneuliun 'I he differences be- 
tween handwriting needs of duldrm and adults, between literate and illiterate 
learners, printing first \s uusive first or vice \ersa, handwritrig proliiems of specific 
language groups (Arabic, for example), etc 

2 The theoreh<al for - Lthons of controlled \ersns fre» composition instruction. 
Strengths and weiknt-ssis of both techniques Integrating both ti dumpies 'I he stu- 
dent journal 

J The need for helping l'M t students achieve indepi ndent e at tin* paragraph and 
composition le\el of writing as well as at tin* sentem* level 1 iw need for KSL 
teachers to understand tie- requirements of Knghsh discourse, and to eonsistently 
shift liaek and forth between controlled and fn omposition assignments 

I Tedmiques for integrating reading and writing l tudmn I he pros and eons of 
using models in teaching V.%\, writing 
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The Social and Linguistic Behavior of Good 
Language Learners 

R. W. Cathead, M. A. Strong & I W. Fillmore 

University of California, Berkeley 

This paper reports on the individual differences in English language acquisition 
of 30 immigrant children after one year of exposure to the se ond language 
We focus on the process of language learning as evidenced l>y the children's 
behavior in their classrooms anil while at play The relationships of several 
aspects of cognitive style and personality to the social and linguistic hehaviors 
of the children are also discussed. 

An important aspect of the study is that the subjects' communicative com- 
petence is assessed not onlv by standard elicitation techniques (repetition, story- 
telling and interviews), but also from numerous observations of their participa- 
tion in regular school activities These observations allow us to show the relative 
usefulness of different language sampling techniques, emphasizing in particular 
the difference between elicited language and that collected from the learner's 
environment. In addition, we suggest possible implications of the research for 
teachers of second languages and those who have second language learners in 
their classes. 

Criminals, cultists, and slow language learners have one thing in common, 
their aberrations are usually explained in the same two ways. Either the system 
is to blame, or the individual is deficient. Luckily for teachers, they are more 
often faced with the slow language learners, but unfortunately they have yet 
to be shown how to determine whether the system or the individual is at fault. 

There is, however, a third alternative wLch is based on the belief that 
different individuals have different learning styles. Thus slow language learners 
are not deficient in their abilities to learn, but they exhibit characteristics which 
inhibit their learning in the particular situation in which they find themselves. 
The problem for teachers is first to identify the individuals' characteristics and 
then to guide them towards the most suitable learning program. This paper 
will summarize some recent reseaich, in particular our own. on learner char- 
acteristics and the assessment of their language, and will focus on the implica- 
tions of this research foi teachers. We make the distinction between social styles, 
which encompass certain personality characteiistics, and cognitive styles, which 
determine approaches to problem-solving 

The forerunners to this research wen Carroll (1963). who looked at lan- 
guage aptitude, and Gardner and Lambert (1972), who examined attitudes and 
motivation, Research on the relationship of cognitive styles to language learning 
is still relarivHy rece' • Nainian, Frolich and Stem (1975) included certain 
cognitive styles among a battel y of independent variables in their study of the 
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good language learner. They totmd that field-independence was related to two 
of their language measures, imitation and listening comprehension, and that 
tolerance of ambiguity was related to listening compiehension Their subjects 
were junior-lngh and high school students. Fillmore (1976) conducted a longi- 
tudinal study of five Spanish-speaking children learning English in an American 
elementary school Her observations indicated that several traits appeared to 
promote language >earnmg. These were: sociability, preference for play situ- 
ations which required language, Hexdnlih. and a straight-forward approach to 
problem-solving. 

Research on personality factors has yielded contradictory results, partly 
because the age ol the subjects and the language learning environment have 
varied widely from study to study. The factors which researchers have studied 
include sociability , extraversion. introversion, empath), anxiety, ego-permeabil- 
ity and self-esteem In our study\{ reported last year at the Second Language 
Research Forum m Los Angeles), we examined four cognitive styles: field- 
independence, toleiance nt ambiguity and breadth of categorization, and four 
social st>!es outgoingness, si lf-e.steeni, submissiveness and egocentrism. 

The subjects were* twenty children, twelve kindergartners and eight first/ 
second gradeis, of mixed language backgiounds They were given standardized 
and adapted cognitive style tests, they were observed in different settings, and 
the teachers rated their personality characteristics. Language acquisition was 
measured from a series of ehcitation tasks and observation notes. The elicitation 
proceduies included imitation and storytelling tasks, and a 20-nrinutc interview. 
The billowing measures were used in the analyses errors in imitation, breadth 
of vocabulary in storytelling and fie' conversation, and structural complexity 
fiom the inteivievv. Additional examination was made of the form and function 
ol the spontaneous classroom utterances, 

Statistical anaKsis of the ian\" oiciers on the cognitive and social style 
measures and language s<<>ies produced the results seen hi Table 1. The sig- 
nificant associations tor the fiist second graders were between outgoingness I 
(as measured from teaehei ratings) and structural knowledge, outgoingness II 
(as measured Irom the observational data) and all language measures; and 
adaptive flexibility (measured \y path-finding task) and all three language 
measures Foi kiudergartneis, there were significant associations between adap- 
tive Hexibihtv and stiuetural knowledge, and between field-independence and 
vocabulan breadth 

Sepaiate statistical analyses weie tun on the two groups ol subject* for 
two reasons First the class stiuetures at the kindei garten level were so dif- 
ferent horn those (.1 other giades that me extent of poi missable student inter- 
action vaned considerably theieby confounding observations on outgoingness. 
across giades Secondlv raw scores on the adapted tests are .iot ( oinparable 
across such wide age langes. 

The lesnlts do seem to indicate diffeiences aeioss the two levels. Although 
no precise conclusion can be diawn I mm results based on so small a sample, 
tentative but worthwhile mleiciues can be made, which may fonn the by- 
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TABLE 1 

Kendall s Tau Values of Association Between Three Language Measures 
and Nine Persona! Style Variables. 



Or H 
N = 8 



language 
Measures 

Struct 

Knowledge 

Imitation 

Vocab 

Breadth 



CogmtiVf 
I 2 



• Style 



Personal Styles 



0 04 
-0.21 



0 25 
0 07 



0 71' 0.40 
0.75 # 0.36 



-0 14 0 21 0 75* 0 30 



Socidl Styles 

6 7" 



0 71* 0 89* -0 43 
0.52 0.7 1 • -0 62 



-0.24 
-0 24 



0.24 
0.24 



0 52 0 71* -0 62 -0.24 0 43 



Kg 

N = 



Struct 

Knowledge 

Imitation 

Vocal) 

Breadth 



0 35 -0.23 0.4 1 • 0.07 0.38 -0.02 0 18 
0.26 -0 26 0 02 0 18 1 -0 0.3 -0.15 0.06 



-0 18 0.08 
0.06 -0.17 



1 = Field ln<If»|M>ntlen< e 

15 z= Tolerance of Ambiguity 

3 = AfUptiVf F!exibilit> 

t = Breadth of Categorization 



043* -0 24 -0 02 022; 0 36 0 15 -021 0.02 0 30 



"> — OutgmngnosH I 
f> = OutgoUgness II 
7 — Self esteem 

H — HubfujiwiveneKx 
0 — Kgoo*«ntnsni 



potheses for our future studies. It would appear that good language learners at 
the first and second grade level are outgoing and flexible. The inconsistent 
results for kindergartners may he partially explained by the well-known un- 
reliability of trait-measurement in such young children. 

On the other hand, it is interesting to look at the results which were sig^ 
nificant for kindergartners One possible interpretation is that different personal 
styles influence different kinds of acquisition at the kindeiir.u ten level. Thus it 
seems that a kindcrgartnet who is flexible will more quickly acquire structural 
knowledge, and one who is field-independent will use more novel vocabulary. 
This is logical since the disembedding skills associated with field-independence 
may relate to the ability to isolate and identify single words, while flexibility, 
or the ability to generate novel solutions to a problem, could well be related 
to the creation of new patterns of words While this explanation seems plausible, 
it remains to be established win the same findings were not obtained for the 
older children. 

One possible source of the diffeiences between grade levels might befthe 
activity structures of the classrooms themselves, in the kindergarten classrooms 
main of the activities were group projects, where noise level was higher /and 
children were permitted to interact more freely. On the other hand, in the first 
and second grade classiooms much or the work was individualized and children 
were expected to function more on their own. The heterogeneity of the tasks 
encouraged less cooperation and thus less interaction. In this type of environ- 
ment there is more responsibility on the individual to create interactive situ- 
ations for herself, which makes outgoingness an essential finality if the child is 
to obtain plenty of language input from her peers. 

Analysis of the language samples revealed two important factors for con- 
sideration. The first was that individuals frequently performed differently on 
different skills. That is. no two rank orders of language measures correlated for 
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the kindcrgartners. Thus a child's score cm one measure may not be an accurate 
representation of her overall skill. Tins is illustrated by the rank order of two 
subjects, seen in Table 2. It these two children had been tested only on imita- 
tion, thev would both have appeared good language learners. If they had been 
tested only on storytelling, they would have been judged poor language learn- 
ers. However, their ranks on structural complexity give yet a third picture. 
Imelda appears proficient while Susanna is ranked the lowest of the 12 kinder- 
garten subjects Furthermore, ESL teachers rated Imelda one of the most ad- 

TABLE 2 

K.tnk Orders for Two Subjects of Three Language Measures 

Imelda Susana 



Measure 

Imitation (errors) 
Story (vocabulary) 
Inter* lew (structure) 
N — 12 



3 1 
10 12 
2 12 



vanced ESL learners at her school, and Susanna was ranged 15th our of 20 at 
hers. This gives rise to the interesting possibility that children's learning and 
interpersonal stvles might affect language acquisition, not necessarily in a gen- 
eral way, hut diffeientially so that the learning of some skills may he> enhanced 
and of others retarded hv the same hehavioral or cognitive qualities 

The second factoi tor consideration was the qualitative difference between 
the natural language samples and the elicited language. Vocabulary breadth 
corresponded in both samples, but the ioini and function of the language varied 
so greatly that other comparisons were difficult. A typical example of the dif- 
ference between the story, the interview, and spontaneous language is given in 
the Appendix 

It is important here to consider the influence of context on language. Fill- 
more (1976) has pointed out that in the early stages of second language acquisi- 
tion, children will get themselves into contexts, or situations, which involve them 
in language. She formulated these social strategies, among others: 

1 Join a group and act as if you understand what's going on, e\en if ><>u don t, 

2 Cur the impression, with a few welkhosen words, that you can speak the 
language. 

(pp. 667-669) 

It is not stupiiMug that 'he language which results from a social context where 
these strategies are employed is very diderent from the language used in a 
regulated context where an expeinnentei is saying: "tell a stoiy", or "answer 
these questions" 

The spontaneous language data icvealed interesting individual differences 
related to language function In general 63V of the spontaneous language con- 
sisted of directives of some kind Table 3 provides a breakdown of these di- 
rectives for the kindeigaitcn ehildien. It can be seen that simple imperatives 
(e.g "give me that!") comprise about 271 of the language non-imperative forms 
(such as "I need a led" oi that one's mine") make up anothei 20% and atte; - 
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TABLE 3 

Functions of Classroom Language for Kindergarten Subjects 



Subject 



Cho 

David 

Manta 



Ana Lucia 

Alberto 

Sabina 

jo Sun 

Elena 

Juan 



Iinelda 
Susana 
Raymon 



Imperative 

5 
2 
5 
4 
J 
I 

" 2 
1 



23 



Non-imp. 
direct 



Attention Dispute Self-verb 



19 



1 
4 
1 

18 



1 ! 
I ! 

4 'Other = 3 



lft 



17 



T = 95 



tion-calling-devices (or ellipted directives such as "hey''" or "Joe!") comprise a 
further 19$. An additional 172 is language involved in some sort of dispute (such 
as "I'm not supposed to . . or "teacher said so"). Of the remainder, most lan- 
guage consisted of self-verbalizations (17£). 

The individual differences in usage seem to be related to stage of language 
acquisition proficiency. At the earliest stage the child seems to use attention- 
calling devices regardless of their appropriateness. One child, in an attempt to 
engage his neighbors in conversation, resorted to pointing and (ailing attention 
to objects around the room. His efforts were unrewarded on that occasion, and 
he was generally ignored. At the intermediate stage similar language is em- 
ployed, but this time more selectively and appropriate to the context. We ob- 
served this from a child who was trying to take part in an ongoing discussion 
at her table. She nodded and smiled, and when one child punched another, she 
punched the first. When the conversation turned to broken crayons, she shouted 
argumentatively, "Me jiot broken*, while holding up one of her crayons. At the 
more advanced stage many more indirect directives are used, and the children 
seem to be able to use language according to the demands of the task. In one 
example, the subject was doing a puzzle with another child and gave him a 
string of indirect orders of the type: 'that belongs to the be"" and "that don't 
goes here". Seventy- two percent of such indirect diiections from the kinder- 
garten group came from the two best language learners. A possible explanation 
for these' apparent stages of language use might be that they are related to 
personal styles. Another study is needed in order to establish whether all learn- 
ers go through each stage, oi whether the faster learners begin at stage two or 
three. 

The implications of this research for touchers arise from two separate issues. 
The first concerns the application of. the idea that social and cognitive styles 
affect rate of language acq it ion, and the second concerns the definition and 
assessment of language competence. 

For example, if we accept that outgoingness is a desirable quality for lan- 
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guage learners, the teacher has two possible courses of action for promoting 
language learning toi the less outgoing children S he might tr\ to encourage 
peer interaction that the children have difficult) initiating on their ^^j^ by 
assigning them more group tisks. In KSI, classes this might invoke the Enlist- 
ment of nati\e speaker> as coworkeis on language learning projects. Alterna- 
tively, the teacher might try to match the instructional method with the chil- 
dren's individual styles, providing the resources are available Thus, the less 
outgoing child might prosper in the structured environment of KSI, classrooms, 
where there are no demands on her to interact with her Knghsh-spcakmg peers. 
The outgoing students may do better in immersion or regular classrooms. 

For the childien who lack flexibility, the teacher again has, logically, two 
approaches. The first would be to provide language tasks which train the stu- 
dent to be flexible such as finding multiple meanings for words or expressing 
the same ideas with different constructions. The second method would involve 
concentrating on pattern examples for these children while providing more 
open-ended types of exercise for the more flexible learners Of these two pos- 
sibilities, it would seern that adapting the program is more plausible than trying 
to change a child's cognitive style. 

The implications for language assessment involve, first, the danger of meas- 
uring only one skill. The example referred to in Table 2 .shows how individuals 
may var> across skills. If these two children had be^n assessed from perfor- 
mance on a storytelling task, as is done in many California school districts, 
Imelda might have been placed in a beginning ESL class, although her other 
skills were much further developed. Assessment from an imitation task or any 
error analysis would have ascribed greater proficiency to Susanna than she 
merited front her performance on all other kinds of tasks. 

Secondly, to measure competence from any relative I \ limited impressions 
of spontareous language may be misleading. As Fillmore (1976) pointed out, 
and as we have noted in this study, a common and effective social strategy for 
language learners is to join a group and pretend to know what is going on, 
preferably using a few carefully chosen language formulas. Listeners are fre- 
quently mislead into believing that if an utterance is correct and appropriate, 
the speaker must ha\e sue cssfully learned the words, learned the "grammar", 
learned the intonation, and figured out when to put them all together. Thus d 
shout of. "Hey, gunine one of those", may give a false impression of linguistic 
sophistication, where it merely reflects a careful attention to context ^nd ap- 
propriateness. As a result of this strategy, the child may know and use a great 
many of these 1 formulas appropriately, without being able to construct impera- 
tives or prepositional phrases in an interview, or even without being able to 
repeat similar phrases in an imitation task. We need to consider, then, for what 
purpose we are measuring language competence befoie we choose how to 
measure it. 

At this point it should be made clear that our research is still at too* early 
a stage for us to leach conclusions Implications for teachers should be stated 
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only with the qualification that further hypotheses need to he tested before the 
results ma\ he consideicd firm Howevei, we aie an\ion. that practitioners 
should he constantly aware of the \v«uk that is being done hy lesearehers. and 
we feel strongly that feedback hom the teachers is crucial tor planning future 
studies 

To conclude, we believe that mdiuduab difler in their cognitive and social 
skills in such a \\a\ that no one language learning situation is suitable to every- 
one. Individuals diliei in then ahihtv to get the lequned exposure to the lan- 
guage, and in the amount of help thc*y need from others in order to learn. If 
the language learning environment [ of which instiuctional piograins are types) 
is compatible with the needs of the learner, then learning is facilitated. If there 
is a mismatch, then learning will be impeded. At present, the instructional 
needs of non English-speaking students are 'lefined by the fact that they do 
not speak English, rather than anything more subtle Thus, students are simply 
placed in whatever kind of special program is available or being tried out at 
the moment. Ml programs work tor some students bet no programs work for 
all. This means that the attributes of ^ach individual must be assessed and 
catered to If we succeed in identifying these attributes, then at least we will 
be in a position to help solve the problem of slow language le^ners, while 
happily leaving the problem of criminals and cultists to others 

APPENDIX 

1 Liiguagt< Samples inr One Subject ( Cho ) from two hliutahon Measure and One Clavsroom 
Ohspr\ ahon 

Storytelling task * 

wim a cai tmnv, to go to ♦omewheie son >ys is \\ t is ruling a bike and that 

man mad u that bo\ n when that man go faster ,tn that bo\ is flyng awa> as he 

fell down her hike nroke an he got fi\ it « th"n v fix it then lu-r car \va« broken 
n lit' can't fix it n thru he tr\ to h\ 

Intvrx tvu 

Interviewer All right i an \ou tell me what * happening 

Clio. It's night 

Int. Uh hnh Its night and whos sleeping? 

Cho Witch 

Int Uh huh 

C!ho, Jus witch sh<M-s 

Int Uh huh the n itch's shoes OK so the witch is sleeping ami when 

Cho H t mt cle broom on di 

Int In the eorne*\ In h >eh, so she's got her bioom in the corner, ami where are her shoes? 

Cho. Down 

Int. Down, yell, under the bed and when* has she got hei ha 

Cho. Under de, up de bed 

!nt Or the bed, right, 

Classroom observation idotnfi a puzzle) 

Directives, no, the face (put the face in) Dispute I told you 
take it yes, I can 

sJ °P that cu/ I wanna ph. 

see, see 
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Non-Imp. Directives, that one belongs to hun 
this belongs to the bee 
that don't goes there 
no, no 

that don't fit 

let's do this one again 



Self verb: 1 am 

bird goes here 
dere 

and dis one goes here 
I'm done 
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On Speaking Politely in a Second Language 

Robin Scarcella 

University of Southern California 

An important task in acquiring a second language is learning the rules and 
mechanisms underlying appropriate use. However, relatively fevP researchers 
have been able to identify the specific aspects of communicative competence 
which our students need to know to communicate effectively. Even fewer stud- 
ies have examined the problems which our students encounter when using 
sociolinguistic rules. In light of this gap, our study focuses on one area of socio- 
linguisiics rarely, if ever, investigated, that of pragmatic competence. Our 
objectives are threefold: (1) to investigate the politeness strategics employed 
by both LI and L2 speakers when addressing speakers upward in rank, of 
equal rank and downward m rank* (2) to identify the politeness strategies 
which adult L2 performers have difficulty acquiring; and (3) to provide prac- 
tical suggestions for teaching politeness strategies. 

The subjects of our study were adult ESL learners (N ^20) who we.e 
aL'ked to participate in role-play situations specifically designed to reveal the 
subjects'* use of politeness features. (Native English speakers served as the con- 
trol group.) The situations were video-taped, transcribed, and analyzed in terms 
of those politeness features suggested by Brown and Levinson H974) Tbe re- 
sults, arc significant in that they suggest the specific areas of politeness which 
deserve greater emphasis in the ESL class. 

One of the most important tasks in acquiring a language is learning the 
rules and mechanisms which underlie appropriate use. (See, for example, 
Hymes 1967, Grimshaw 1973, and Taylor and Wolfson, 1978.) Paulston (1974), 
in fact, contends that learning the appropriate forms of social usage of a lan- 
guage is just as important as learning the linguistic forms. As^Iymes (1,967) 
argues, communicative competence deludes not only the linguistic forms of a 
language, but also a knowledge of when, how, and to whom it is important to 
use these forms. 

While many .econd language researchers and pedagogues have emphasized 
the importance of communicative competence in teaching English as a second 
language, (for example, OUer, 1973b; Rivers, 1972; Wilkins, 1976; and Halliday, 
Mcintosh, and Strevens, 1964), relatively few have been able to identify the 
specific components of communicative competence which our, students need 
to know to communicate effectively. Even fewer studies have examined the 
'problems which our students encounter when using sociolinguistic rules. In 
deed there has been very little research on trie acquisition of the sociolinguistic-" 
interactional rules which might govern communicative competence. (But, see 
Dunkel, 1978, Schmidt, 1978. for some current inV^jgation.) Perhaps this is be- 
cause we in second language have only quite recently turned to bojh socio- 
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and psycho-lmginsties when ( haiacten/uig the language acquisition process, 
(Hatch, 1978, Hakuta and Cancmo, 1977) 

In light of this gap, our study focuses on one area ot sociodmguistics 
rarely, if ever, -investigated for the purposes of second language jesearch* that 
of pragmatic competence More specificalK. we (wis on the use. or perhaps 
misuse, of politeness features in adult fiist and second language performance. 
Although much of the literature, eenteis on what native speakers do m their 
o; n language, (Lakotf, 1973, Brown, t975. Brown and Lcujison, 1974 Fergu- 
son, 1976, and Shunaiiofr. 1977 i ; re|ati\el\ few studies have'been icported on . 
the ac(|Uisiti(m of politeness featuies h> secoiifl language learners. 

Generally speaking, politeness use is motivated In a speakers desire tq^_ 
save face, 1 her/his own face, or too heaicr's face. Acconling to Brown and 
Levinson (1974), there are two- types of politeness- positive politeness which 
is uS^d to satisfy the speakers' needs foi approval and belonging, and negative 
politeness which functions to nnmuii/e the imposition of the fiice-thrcateinng 
act. These politeness types inav \ ary .according to the nature of the a. itself 
and status which we define here in terms of the social distance and relative- 
power of the speaker and hearer. 

In this paper, we examine the eflect of status on pohteness m adult second 
language performance. In so doing, we describe and analyze the politeness 
strategies employed by native (LI) and non-native J \:2) speakers when ad- 
dressing speakers upward in rank, of equal iauk, and downward in lank, (here- 
after referred to as superior, equal familiar, and siiboidinate,'.iespcctr*ely ). 

Our objectives are twofold 1) to inNcvtigate the politeness stiategies used 
by adult LI and L2 speakers, and 2) to identify those str^vgies which adult 
L2 performers have difficulty utquuing In ordei 10 achu\c T^iese goals, we 
undertook the small stud\ declined 1 . low 

1. Method 

Subjects 20 adult L2 performers (10 advanced and 10 begun.., ig) J par- 
ticipated in our study. The subjects wen 4 male, spoke Arabic as a fust lan- 
guage, and ranged m age from 20 to 27 Oui control group, 1 consisting of 6 
native English-speaking males, langing in age honi IS to 26, enabled us to 
compart\the speech ot adult LI and L2 spedkeis 

/V/sA v ^^snbjects weie asked to participate '«i three iole-pla\ situations 
These situa^ons were specificalb <t< sign<d to ie\.;al the subjects* use of polite- 
ness features when speaking to snpeimis, equal familiar, and suboidi nates'. 
The situation is described bfclow, 

} Brown and Levinson ( 1971; tli fine fate a. ,t s< t of wants salifiable onK In the actions 
(■HicludinK expre* • ../ns of wants) of others 

• \jV\v\ Has determined I" sions mi tin- Ann ru an I tumiaue Institnli (I SC) I'Lueim-tit 
Kxam, Spring, 1978 / 

3 The subject* of the control uroiip were nniwrsity stud< nts who forimd pa)t of the USO 
debate team. 
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You art- planning ,tti tiffin- part> You mwte ><mr Im>ss, the dirk who works under 
yon, and your good friend, a fellow employee You request that each tif \our guests 
coeie unaccompanied by ,,js wife 

*Th"re wen- many advantages in iism^ role-play. First, it allowed us to 
obtain -complete eonveVsutional interactions, containing both conversational 
openings and closings Second, it enabled us to control the conversation to a 
certain extent. This in turn provided ns with comparable samples of speech. 
Third, it facilitated videotaping, important when examining paralmguistic fea- 
tures of discourse. Fourth, we found, as did Andersen (1977), that role-play 
speech gapped an awareness of language appropriateness However, a word 
of caution seems necessary here. T -%\ investigators have .suggested that role- 
play speech is not representative of natural language. (See, for example, 
Mitchell Keman and Kernau, 1977, and Littlewood, 1975, for role-play in adult 
1,2 performance.) Thus, while role-play speech may reflect ones ability to 
vary language according to the soeiolmguistic context, it probably does not 
indicate what one actually says or does in m,*re relaxed, natural situations. 

Data Analysis Each of the 78 speech samples was transcribed by the two 
researchers. The samples were then analyzed in temis of those features of posi- 
tive and negative politeness suggested by Brown and Levmson ( 1974), (Meas- 
ures of politeness were decided upon in advance r^ the two researchers. See 
appendix for exact definitions ) 

2. Discussion of findings 

In tlii > section we examine positive and negative politeness in LI and L2 
performance. Comparisons of the use of politeness features by beginning and 
advanced L2 performers and native English speakers are made whenever pos- 
sible. 

Positive Politerwss. In positive politeness, Brown (1975) state?, "H (the 
hearer) is treated as a member of an hi-group, a friend, a person whose per- 
sonality traits are known and liked." (p. 6). Common ways or achieving posi- 
tive politeness include, 1) expressing mi interest in and nothing H, 2) using 
in-group language (for example, address terms, slang expressions such as "ya 
know", ellipsis, inclusive "we", and piosodic and kine.sie features, 3) making 
small talk, and 4) showing and seeking agreement (Brown and Leviuson, 1974). 

1. Expressing on interest in and noticing hi One of the principal ireans 
of expressing politeness is by showing interest in and noticing H. This can be 
done through greetings. In our data, most of the conversajjifflts, (75 78 or 95$) 
began with greetings Only in the beginning (low level) enactments were greet- 
ings absent. Indeed, we suspect that politeness formulas suclnas gujtings may 
be one of the earliest means of demonstrating politeness in a s< eond language. 
K)ur data do, however, indicate that even advanced (high level) L2 performers 
have difficulty using appropriate greetings,* 

4 To facilitate identification of the data, we have labelled our examples TI.-» following 
abbrevifttioas are used. SS stands f«.r Nati\e Speakei, ///, stands for High Level L 2 Speaker 
and ZX a, used for Lou Level L, speaker B is for boss, E is for employee, and C is for clerk] 
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„ il) K. Hello 

F: Hell- Welcome. 

(high Lev! L2 'HI, J Situation 2.Gioup I ) 

2. ('*ing "in-t'soiii>" lan&iaze. In-group language stresses speaker involve- 
ment*with the ruarer. It is here where we find striking differences between 
LI and L2 speakers. 

One means J emphasizing irler Jliness and sondarih is through the use 
or in-group address terms, (for ex* aple, first names) and endearments (for 
example, pal. huddtf. honey) (Brown and Levisofi. 1974). Although the L2 
speakers used endearments to stress speaker involvement and minimize imposi- 
tion (See example (2) below), the LI speakers did not. 

(2) E. Bot friend, I don't want you to bring your wife 

(Low level L2 (LL):2:7) 

Slang, another marker of in-group language was frequent, at least in the 
LI data, a.id was used particularly with equal familiars by the native speakers. 
The non-native speakers useu slang randomly, th-it is. in all three role-plav 
enactments. 

TABLt L Slnitf 

I'mvntaxe' of shing'" directed tti b'.,\\ (superior) iruud (equal familial J 
and clerk ( stilx>rdinaJe )' 

Subjects S To boss To friend To clerk 

Native speakers 2.3'i <18) v I IV (32) 35* 1 27 ) ' 

High Level L2 4OT (2) 20V (1) 40V (2) 

Low leve l 12 

■Tli* i>mentag* reprint* the it tint nf slantf used m ea»li addresser 

'•HLii* e-.preaaimw, counted N>|wratrly by the t*o reaearrhers, were defined ax « *P""i«"< «f fil 1 !??? 

!ftiik'iia*e outside of Ktamlan! lai.KnaKe. (e»r. My" ' (i<>t if ami * »*. Heal with that ) 
' The „nimil»rr o." octurrenres »f slang expressions i* in pumMrtews 

Expiessions such as r/rt fcnmu, ./ mean, and you understand, which make 
the addressee a more active participant in the conversation, are also considered 
j L >art of in group language ( Lakoff, 1973). Such expressions appeared fre- 
quently in the LI data, less frequently in the high level L2 data, and infre- 
quently in the' low level L2 data. The L2 speakers did not always use these 
expressions aj ropriajely. 

(3) ti: Will you almost uh make my home as your home you know. 

':HL:2:2) 

Ellipsis (here defined as the omission of a subject and or verb in an 
utterance), is another marker of in-group language. Ellipsis was present Li the 
LI data*Mn all- contexts. The L2 speakers used less ellipsis, and the t, oe of 
ellipsis which they emplo>e<ftfiiFered fiom that used 1>\ the LI speakers. For 
example, the ellipsis most ' frequently used by the L2 speakers consisted of 
phrases used m response to a question. 

. 2,! 
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(4) B: At what time is the party? 
E: About 7.30 

(HL.1:3) 

The absence of ellipsis in tlie L2 data often made the speech seem overly 
formal. Consider, for instance, examples (5) and (6): 

(5) E: Excuse me sir I am sorry to interrupt you at this time 

(HL:1.4) 

(6) C: What about the address 

E: The address is 2nd and Vermont 
(LL:3,10) 

Inclusive "we", the we which includes both the speaker and the hearer, 
in the conversation, also occurred frequently in the LI data, mainly with the 
superior and equal familiar, but also, to some extent, with the subordinate. In 
contrast, inclusive Ve was absent in the data of the L2 speakers. This finding 
is significant in that it indicates that inclusive toe may be acquired quite late 
in adult L2 acquisition 

* 

Falile 2 Inclusive "We" 
Percentage and min.ber of occurrences of inclusive "we" used w.th each addressee 

Subjects % Tolxiss To friend To clerk 

Native speakers 38? <8) 43* (!)) 19* (4) 

,High level L2 ■ — 

' Low level L2- ' — — 

* Incluaive inrluden both the xpmker and he*rer mi thf conversation, (e g '! don't under 

ntand utmt we'rr icom* to do " r ' np'll 1# out of there h> 12 00) 

In-group language can also be characterized by its prosodic features, (for 
example, volume and rate). We found that LI speakers spoke faster to the 
friend than the superior or subordinate. (See Table Three below.) Thus our 
data are consistent with the findings of Brown and Levinson (1974). Not sur- 
prisingly, the low level L2 speakers, unlike the high level L2 speakers, were 
unable to use fast speech to convey friendliness. More observationally, we found 
that both LI and L2 speakers spoke louder when speaking to subordinates or 
superiors. And, we noted that generally the high level L2 speakers spoke much 

TABLK 3 Speech Hate 

Words per minute* used h> LI and hstfh and low level \fl speakers 
when addressing speakers of different statuses 

Subjects To boss To friend To clerk 

Native speakers 162 174 156 

High fod L2 120 162 144 

Uvv level 1.2 144 120 120 

*S|»rfrh r»if uftN meantirrd in word* p«*r minute, baaed on tlir firM ormrrrnce of fWnt 
*I>ee«h in d( h tainple (All full word* but not word f mtciiirnix were Mmntrd ) We denned 
it word an a Htandard orthojfrnpfnr mot Whilr prniwr tiftiiien Ami exi Initiator* ?xpr*Kftion* 
counted as one word, concatenate tortus *uch its "kind*" were counted n* two word* 
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louder than either the ' ^\ Ie\el fS? speakers 01 native speakers, (at times, in 
fact, almost shouting). 

Perhaps the most salient maikeis of m-group language are kinesie. Beth 
LI and 12 speakers used bod\ gestures wlneli eim \asi/ed solidarity. When 
speaking to an equal familiar for example, the spe tkers often leaned forward 
stood elose. and fated their eonveisationa! partner squarely In addition, there 
was frequent smiling and laughing, head nodding of approxal, touching (cspe- 
eially hand shaking and shoulder slapping at the onset of the interaction), and 
outward hand and arm movements When speaking to a superior, on the other 
hand, there was more distancing, less laughter, inward h nd and arm move- 
ments (including slumped shoulders), and downward eye gaze. When com- 
paring the LI and 1,2 speakers, we ohser\cd that the L2 speakers used moie 
outward hand and arm movements, more touching, (especially hand-shaking), 
le«s distance and more direct eye gaze. * 'le these kmesic features may in- 
dicate friendliness and solidarity to Arabic speakers, English speakers may per- 
ceive them as over-friendly and even imposing. 

3. Making small talk One way to claim coi .on ground with an addressee 
and avoid abruptness is by making small talk. In such a way, the speaker 
enhances the hearer's positive face. In general, the speech dat i to friend was 
marked by the presence of small talk in all speech samples. There were, never- 
theless, distinct differences in the use of extended openings and closings by 
LI and L2 speakers. (We include openings and- closings in this discussion 
since, like small talk, they function to eliminate abruptness in a conversation.) 
When speaking to a superior, we observed, the LI speakers only used short 
openings, such as attention -getter* 

(7) E: Mr. Jones Mr Jones. 
B: Yeah. 

E: Uhm. I was wondering uh we're having a party. . . . 
(NS:1.2) 

Longer openings were rescued for equal familiars and subordinates. The L2 
speakers, particularly the low level ones, howe\er, often used extended open- 
ings in all contexts, as in (8). 

(8) E: Hi Saled. 
B: Hi 

E: How are you doing 
B. Fine just fine 
E: What's new. 

Uh I am very glad to see (to tell you) we have a party j 

(LL:1:9) 

Ferguson (1976), in discussing politeness formulas, notes that elaborate greet- 
ings are often used in Arabic. It may be that the extended openings used by 



On .Speaking Politely 



* 281 



the 1.2 performers result from a transfei of \rabic discourse-politeness strategies 
into English, V 

Closings also difiered among the three groups of subjects. The LI speakers 
used a variety of pre-t losings, (such as alright, okay, yeah, and fine), to politely 
indicate that the vonversation was ending The L2 speakers, on the other hand, 
did not. Only in a few cases, in fact, were the high level students able to use 
the single pre-elosmg, okay. Thus, our data suggest that okay may be the first 
pre-elosing acquired by some L2 performers. 

4. Being agreeable Positive politeness can also he achieved by being 
agreeable. Linguistic m irkers of this categoi . include repetitions of all or part 
of what a preceding soeaker has just said and positive back channel cues. 
(Here we define positive* back channel cues as markers of agreement such as 
yes, true, and really.) Repetition in this study was not context sensitive. That 
is, all of the 1.1 speakers employed repetition randomly, in all contexts. Repeti- 
tion appeared to a much lesser extent in the L2 data of the low level speakers, 
in addition to using repetition, the LI speakers used a large variety of positive 
back channel cues :n all contexts. Unlike Lakotf (1973), however, we did not 
find these markers se nsitive to the sociolingiustic context. The L° peakers also 
relied heavily on ih'se expressions. However, their hack channel cues consisted 
primarily of the expressions yeah and okay. And, they did not always know 
the co-oecurence rules involved in using these rules appiopriatcly. 
(9) F: You're my friend. 

E: Okay and please don't uh bring your children and your wife. 
' (HL:2:2) 

5 Seeking agreement Seeking agreement by using such question tags as 
right?, okay?, and alright? is also a part of positive politeness. Such question 
tags woe present in all contexts in the LI data, but lacking in the L2 data. 

Negative Politeness 

Having discussed positive politeness, we now turn to an examination of 
negative politeness. Underlying negative politeness, are strategies geared to- 
wards preventing infringement upon the hearer's freedom of action. They are 
central to deferential behavior when addressing those higher in rank and char- 
acteristic of social distancing behavior in general. Negative politeness strategies 
most cVnmonly found in our data were: 1) hedging; 2) indirectness; 3) im- 
personanzing, md 4) using deferential address terms. 

1. Hedging Hedges have a variety of surface manifestations. Among the 
most common are particles such as fust, maybe., kind of, and sort of, expressions 
Mich as / wonder i/, the thing is, something like that, and to a lesser extent 
verbs such as think and uiodals such as could. 

(10) E I kind of have to talk it over with everybody 

(NS:1:2) 

(11) E: Just a bunch of the guys getting together 

(NS-1:2) 
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(12) E. We were wondering if you wanted to come along 

(NS:1:1) 

(13) E: We are thinking about since most of the guys arc gonna be 

there we could invite you 
(NS:1:1) 

Overall, hedges were much more characteristic of 1,1 speech than of L2, as 
illustrated in Table Four below. ^ 

TABLK 4 Hedges 

Percentage and nurcuVr (if occurrences of Hedges used by Id and high and low level 
L2 speakers when addressing superior, equal familiar subordinate 



Subjects To boss To friend To clerk 

Native speakers 51% (50) 20* (20) 27* (27) 

High level L2 52% (9) 5? ( 1 ) Ml (7) 

Lo w leve l L2 _ - 1% _JV __, m (l4) Mj? (11) 

In marked contrast. L2 speakers hedged to a very limited extent and low 
level students especially, did so indiscriminately. Hedges in L2 consisted mainly 
of just and rruiybe. Moreover, there were cases in which these hedges were 
used altogether inappropriately. 

(14) E: I think we have a party tonight. 

(HL:1:5) 

(15) E; There's maybe something you don't like because maybe you 

can't bring your wife 
(LL:1:7) 

Such uses of hedges are clearly inappropriate for the successful completion of 
acts. Consequently, in addition to lacking a complete repertoire of hedges, the. 
L2 speakers in our study lacked both a knowledge of their semantic function 
and distribution. 

2. Indirectness, If as Searle (1969) claims, "in directives, politeness is the 
chief motivation for indirectness", we might expect more indirectness occurring 
with superiors md subordinates than with equal familiars. The data in Table 
Five clearly indicate that in LI speech, at least, *his is the case. 

TABLK 5 Indirectness ' 

Subjects To boss To friend Toeleik 

Vativ e speakers AM ( 12 ) M ( <) ) 2 Vi ( 5 ) 

High level L2 33'/ (3) UV (3) " 33'/ (3) 

Low level L2 3fi'f (\) VH H) 27 ( 3 )' 

* ln'li!<*rtnr»N ih . <»n ^ir|«-i «mI not m«l>in, .ill nf Hi *|M*kn - 1 i In U in «l<-nrs t \\Au ■ • 
(Shmmtioff l'»?T |< _M") ItuIsm i Mm sh v^ns .nnii*. 'I n «H |'i nili fi'1. In tl< l«» mm im'ui 

The examples below illustrate the extent to which indirectness varied accord- 
ing to each situation. Example (16) was used with the superior, example (17) 
with the subordinate and example (IS) with the equal familiar. „ 
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(16) E: We were Wondering ifvou wanted to come along 

(NS:1:1) 

(17) E: Why don't you jus' come along 

(NS:3:,) 

(18) E: Come n have a beer party at our place 

(NS:2:1) 

What is most apparent in these examples is that LI speakers were more direct 
with the equal familiar than with either the superior or subordinate. Also, (17) 
difteit from (16) in that it contains an imbedded imperative ( Ervin -Tripp, 
1974) making it slightly less polite than (16). Contrary to Ervin-Tripps (1974) 
claim that hints are used more often to familiars, our findings indicated more 
hints being used to both superior and subordinates than equal familiars. Perhaps 
hints, more than any other directive type, vary not only with respect to the 
situation, but also with respect to task. That is, hints will tend to be used for 
those acts which most threaten face as, for example in (19) and (20) when 
they are used with the superior. 

(19) E: It's one of the guys kind of things 

(NS:1:2) 

' (20) E. Its one of those affairs where its just the guvs 

(NS:3:1) 

To equal familiars, on the other hand, imperatives seemed to be more common, 
as in (21). 

(21) E: Ya know you cant bring Joan right? 

(NS:2:2) 

In sharp contrast to LI speakers, L2 speakers confined themselves almost 
entirely to statements of personal desire ( Mitchell-Kernan and Kernan, W77). 

(22) E: I would like to invite you to a party. 

(HL:1:5) 

(23) E: I would like you to join us. 

(LL:2:3) 

(24) E. I want ycu to come in a party. 

(HL:3:1) 

We suggest that such expressions as / would tike may be, for at least some 
L2 acquirers, formulaic devices not reflective of the L2 speaker's grammatical 
competence. As with children's need" statements, these "statements of perse nal 
desire" appear to be one of the first directive types of emerge in second lan- 
guage acquisition. The fact that they were used indiscriminately by low level 
students seems to indicate that they were devoid )f any social (distributional) 
function. 

When using request forms, L2 speakers gei erally were much more direct. 

O 
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Simple declarative statements were often used by both high and low level 
students. 

(25) E. You can't bring your wife and your children to this party. 

(HL:1:1) 

(26) E: You can come alone. 

(ILL:3:4) 

Unlike LP-speakers who used these only to equal familiars, high level L2 
students used these with superiors. To both equal familial s and subordinates, 
however, they used simple imperatives. 

(27) E: Don't bring your wife and your children. 

(HL:1:1) 

The fact that high level students used simple imperatives mainly to equal 
familiars and subordinates, and declarative statements to subordinates, implies 
that they did attach some social meaning to these forms. 

In contrast, the low level students used imperatives invariably. This sup- 
ports our claim that in early stages of language acquisition, many adult L2 
performers have difficulty using syntactic means to convey politeness. 

We also observed that directives were often accompanied by other features 
marking politt^ss. We refer to these co-occurring features as "p res to direc- 
tives" since they functioned mainly to signal that what follows is a directive. 
Examples of these pres include please, I'm sorry, Excuse me, and performative 
utterances such as / tell you. 

(28) E: Please don't biing your wife. 

(HL:1:4) 

(29) E; I'm sorry you cant bring your wife. 

(HL:1:4) 

(30) E: I would like to tell you something else uh you cannot bring 

your wife with you 
(HL:1:4) 

The results of our analysis of pres to directives appear in Table Six. 

TABLE G. Pres to Direct i\es # 



Subjects Tohoss To friend To clerk 

~\at»\e" speakers 52V ill)'" 30V (7) 17V (4) 

High leu'! L2 42V, (12) 31? (12) 26V (10) 

Low 1e\el L2 30V (IB) 31V (14) 317 (14) 

" I'n's won' <lf>ftni*il i.tjn emwiiH Hirudin.: Hui*<ti\tH win 1i a. \AvA>t' I in von \ nml 
"I trll Mia 

ITf>rr \.e note that pres to directive?? were used more often by L2 speakers than 
by LI speakers. In fact, pres, especially performatives, appear almost exclu- 
sively, an L2 politeness strategy. It is our contention, therefore, that pres to 
directives: 1) compensated for the L2 speakers' inability to use other means 
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of indirectness, and 2) are one or the earliest politeness strategies acquired by 
adult L2 performers. 

3. Impersonalizing The use of passives and exclusive "we" lor the avoid- 
ance of *T and 'Sou" lunctiu.is to distance the speaker and hearer. 

(31) E: We're having like a little party at my place 

(NS:1:3) 

(32) E. No wives allowed. 
' . (NS:2:1) 

Most common among these iiiipersonah/iug mechanisms was the use of exclu- 
sive we. Passives, on the other hand, were \ery scant 1 . 

T\BI.L 7 KuIumm- "We" 

E'eicetitaKe Kind number ot ueenriences of exclusive 'we' used i>> LI and 
liitfli and low level 1.2 speakers to addressees (it \amn£ status 

Subjects I'oImiss To friend 'loclnlt 

\ah\e speakers U>Y (I2> 3t\ ( ( > ) iVi (5» 

Hi«h level L2 JJV ( J) itt ( } ) H'< ( 5) 

Low Level L2 ,jf« (4) 36V ( 4 ) 27* (.)) 

'Kxt|iiM\t is ilif ««• \\ lii> »i «s<|iii|es th. h»-arvi in tin «<ii\it\ Oi.h <U-mi (hm-s 

<tf « \i In - | v « \\ i \\ | h t if >J 1i 1 1 ' t 

The data presented in Fable Seven show that LI spe^keis employed this stra- 
tegy most often with the superior and least often with the subordinate. 

L2 speakers, on the other hand, rarely employed tins strategy. As a matter 
ot fact, most striking among the featuies distinguishing L2 from LI speech 
was the overwhelming use of the pronouns I and you. 

4. Using deferential address terms. Deferential address terms (for exam- 
ple, title + last name as in Mr. Jones and sir) are also characteristic of nega- 
tive politeness. While sir was used by both LI and L2 speakers, the L2 speakers 
did not always know the precise co-occurrence rules for its use. 

(33) E: Hi sir 

(HL:3:5) 

J. Summary 

To conclude, our analysis led us to some interesting findings. First, it 
suggests that while some politeness features appear to emerge quite early in 
adult second language acquisition, (for example, pres to directives such as 
sorry and please), others, (such as slang, ellipsis, and inclusive we) do not. 
See Table Eight below for a summary of *mr results. 

Second, the acquisition of politeness forms appears to precede the acquisi- 
tion of the sociolinguistic-interaetional rules and mechanisms underlying the 
use and distribution of these forms. ( Refer to Table Eight above. ) That is, adult 
1*2 performers seem to use politeness features before they have acquired their 
co-occurrence rules and appropriate distribution. 

Third, we would claim, based on the LI data, that the use of politeiif 
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TABLE 8 Summary of the results 
Acquisition (if Form 4 an J Distnlmtion 1 ' of Politeness Featuies 

Native speakers High level L2 Low level L2 

Foim Distribution Form Distribution Form Distribution 
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TABLE 9 

Summary of Results from Analysis of Positive and Negative Politeness Features 

Table 9- A. Positive Politeness* 

Percental * positive politeness features used by LI and high and low level L2 speakers 
to addressers of varying status 



Subjects To boss To friend To clerk 

" Native" speakers 28^39) " ~~ 37% (51) 33% ( 45 ) 

High level L2 36% (7) 31% (6) 31* (6) 

Low level L2 27% (2 ) _J?JL<^) 28% (2) 

* lias* d on » (.unbilled cotal of all positne politeness featittes 

Table 9-B Negative Politeness* 

Percentage of negative politeness features used by LI and high and low L2 speakers 
to addressees of varying status 



Subjects To boss To friend To clerk 

Nc7t7ve~s^^kers " 50% ~( 74 ) 25% ( 36 ) 25% ( 36 ) 

High level L2 39* (24) 26% ( 16) 34% (21) 

Low level L2 31% (27) 37% (32) 32% (28)_ 



* UaM-d on total number of negatne nobleness featuies 

features varies according to the status of the addressee, but that L2 performers 
are limited in both their range of politeness features and their capacity to vary 
their use according to the social context. As noted in Table Nine below, in the 
LI data, the total number of both positive and negative features varied across 
situations' relative to the status of the addressee. As expected, LI speakers used 
more positive politeness toward equal familiars and subordinates than to su- 
periors. Conversely, they used more negative politeness to superiors thaa to 
equal familiars or subordinates. High level L2 speakers, like native speakers, 
used more negative politeness towards superiors; however, unlike native speak- 
ers, they used more negative politeness toward subordinates. (Perhaps this re- 
fleets cultural differences in attitudes towards those lower in rank. It may be 
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the case that lower ranked individuals in this culture do not prompt social 
distancing as they might in other cultures.) Finally, low level L2 speakers 
showed much less variety in the politeness strategies they used and lacked any 
apparent distribution of politeness features. 

The results of this study, while preliminary, are very suggestive. In order 
to confirm and extend our findings, studies in progress examine more naturalistic 
data, data coming from larger number.? of L2 performers of diverse language 
backgrounds. The effect of first language .interference is being examined. In 
order to make stronger claims, inferential statistics are being used. Such in- 
vestigation should lead to a better understanding of the politeness features 
which adult L2 speakers need to know to communicate effectively as competent 
^speakers in their second language. 

x N definitions < * 

Ellipsis. Ohmsion of subject and/or verb not necessary for understanding of sentence. (EX- 
AMPLE: Talk to you later.) 

Exclusive tve. The we which excludes, the hearer in the activity. 

Expressiotis that make the addressee a mbre active participant. Expressions winch often func- 
tion as fillers such as»ya know, I mean, and I see. 

Hedge. A linguistic feature marking the absence of certainty. Included as hedges are such 
words as if, perhaps, maybe, hesitations and disfluencies. 

Inclusive we. The we which includes both the speaker and the hearer in the activity. 
Indirectness. Not making all of the speaker's beliefs or desires explicit, or the avoidance of I 
or you. (See Shimanoff, 1977, p. 239.) 

Positive hack channel cue. Marker of agreement which affirms the hearer's remarks. 
pre-sequence. Conversational device functioning as a preparer for and introducer to a con- 
versational sequence. (EXAMPLE. Are you doing anything tonight? Would you like to go 
to the movies?) 

Rate Measured in words per minute, based on the first occurrence of fluent speech in each 
data sample. (All full words, but no fragments were included.) 

Slang Informal language often considered outside of standard Usage. Slang expressions in- 
clude guy, What's up?, and Can ya deal with that? 

Question tags. One word tags having rising intonation which function as a means of seeking 
agreement. (EXAMPLES: You live on Adams, right?, I'll see you at eight o'clock, okay?) 
Word. A standard orthographic unit including proper names contractions, and excLmatory 
expressions, but not concatenate forms such as kinda which count as two words. 
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The Perception of Politeness in English and Spanish* 

Joel Walters 

University of Illinois 

This paper investigates the perception of politei.ess in English^by native speak- 
ers of American English and second language learners of 17 language back- 
grounds. It also presents data on the perception of Spanish by a group of 
Puerto Rican speakers of Spanish. Those groups judged speech act strategies 
for requesting (e.g., can, could, will, do, etc.) using the paired comparisons 
method. The findings showed high correlations in the perception of politeness 
between male and female native speakers and between native and non-native 
speakers of English. Greater unanimity and more categorical perception re- 
sulted from the non-native and female native speaker's judgments. In the 
Spanish request forms, politeness ori the part of male speakers could not be 
predicted from female perceptions or vice-versa. 

Most of the work in linguistics, anthropology, and sociology on the form 
and function of politeness in communication finds its roots in the writings of 
Goffman (1967). Goffman presents his views of deference in the context of a 
theory of action or behavior in general, not restricting himself to linguistic 
aspects ^ interaction or even to communication in general. He defines the 
concept of deference as "the appreciation an individual shows to another 
through avoidance or presentation rituals" (p. 77). % -appreciation, he means 
the value ascribed to an individual, e.g. in assigning high status or prestige to 
that person. By avoidance " rituals Goffman means those actions which allow 
* social distancing to be maintained, including the avoidance of certain topics 
of conversation. Among presentation rituals he includes salutations, invitations, 
compliments and requests. 

In perhaps the most elaborate extension of Coffman's thinking to linguistic 
aspects of politeness, Brown and Levinson (1978) present an intricate model 
and a lengthy taxonomy of politeness strategies with examples of linguistic 
(including pragmatic, lexical, syntactic and semantic) manifestations of those 
strategies from three very different languages: English, Tzeltal and Tamil. The** 
politeness strategies include: 

1. Notice the listeners interests, wants and needs 

2. Use in-group identity markers 

3. Seek agreement/avoid disagreement 

4. Offer, promise 

• The author wishes to thank Jennifer Bahowick for assistance and comments in prepar- 
ing the paper. 
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5. Give reasons 

6. Apologize 

The linguae examples they provide drawtwjjost heavily on speech acts and 
lexical info^gun. For example, in noticing the listener's wants, interests and 
needs, a strateg^ discussed is paying a compliment to the listener. Moreover, 
some of the politeness strategies actually represent names of speech acts (e.g. 
offers, promises and apologies). The authors conclude finally that "indirect 
speech acts have as their prime raison d'etre the politeness functions they 
perform" (p. 147). 

R. LakofFs (1973) rules of politeness seem to be based on some of the 
same assumptions as Brown and Levinsons' work. One of those assumption^ is 
that the relationship between the speaker and hearer governs the linguistic 
expression of politeness. Lakoffs rules for being polite dictate to the speaker 

how to act toward the hearer. They tell him/her: 

* 

Don*t impose 
Give options 

Make the listener feel good 

Lakoff mainly draws upon lexical strategies as examples for these rules. She 
cites words like elimination as the proper way to avoid imposing on the listener 
and the use of expressions like yknow and / mean as ways of making him/her 
feel good. 

Fraser (1975) defines politeness as a function which is based on the hearer's 
perception of an utterance. It can be categorized then as a perlocutionary effect 
according to Austin's (1962) tripartite division of speech acts. Fraser goes on, 
to discuss the use of various speech act forms, please and certain topics as 
Significant components in the hearers perceptions. 

For this research, the importance of syntactic, lexical, prosodic and dis- 
course information as well as a wide variety of non-verbal data (e.g. physical 
distance between speaker and listener, setting, eye gaze behavior, body move- 
ments) in the perception of politeness was recognized. However, the intent of 
the study was to investigate how much politeness could be squeezed out of 
speech act strategies alone. In other words, the principal research question was: 
How dc idults perceive the politeness contained in the speech act strategies of 
request forms? In that light, the following hypotheses were proposed: 

1. Males and females will differ in their perception of the politeness of the speech 
act strategies of English requests. 

2. Second language speakers of English will differ from native speakeis in their 
perception "of »he politeness of the speech act strategies of English requests. 

3. Male and female Puerto Ricans will differ m their perception of the politeness 
of the speech act strategies of Spanish requests. 



1. Method 

Subjects, Three groups of subjects participated in comparable experiments. 
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The first group consisted of sixty college students who were native speakers of 
American English. Half were male and half were female. The second group 
of subjects consisted of foreign university students of varying (17) first lan- 
guages. Approximately half were either Chinese or Korean, with the remaining 
half equally distributed among fifteen other mother tongues. Forty-five subjects 
were male and thirty were female. All had achieved a high enough score on 
the TOEFL to gain admittance to the university. They were enrolled, however, 
in a course in rhetoric, the most advanced level of training in English for non- 
native speakers. The third group was composed of ten Puerto Rican speakers 
of Spanish; five were males and five females. 

Materials. In the case of the native English speakers and foreign students, 
tne stimuli consisted of 14 generic request strategies which were equivalent to 
the semantic form. These were attached to a standardized lexical content to 
assist the subjects in making their judgments. (Table 1 provides a list of these 
strategies.) The origin of this particular set of request strategies is a corpus of 
256 requests produced by 32 Puerto Rican children in a role-playing experiment 
with puppets. The 14 request strategies in Table 1 accounted for more than 97 
percent of the corpus. Other data gathered through similar experiments with 

TABLE 1 

Generic Request Strategies (English) and Sentences Submitted for Judgment. 

Strategy Sentence 

If you can give me some rice? 
May I have some rice? 
Could you give me some rice? 
Would you give me some rice? 
Do you have any rice? 
Will you give me some rice? 
Can you give me some rice? 
I came to ask where the rice is, 
You can give me some rice? 
You have some rice? 
Where is the rice? 
I want some rice? 
Give me some rice. 

You hav e to giv e me some rice. 

lative English speakers and Armenian speakers of English demonstrated that 
these same strategies accounted for 80 and 90 percent of the corpora, respec- 
tively. Currently, ethnographic and ethological work is being conducted to 
determine whether these strategies are the same ones which show the greatest 
frequency of use in naturalistic settings. 

The Puerto Rican subjects all judged 14 generic request strategies in 
Spanish which, like the English ones, were attached to a standardized lexical 
content (see Table 2). These strategies were extracted from a similar experi- 
ment to the one described above which was conducted in Spanish. The 14 
Spanish strategies accounted for 97 percent of a corpus of 256 requests. 

O 



6 If you can 

13 May 

7 Could 
9 Would 

11 Do 

1 Will 
10 Can 

4 I came to 

2 You can 

8 You have 

14 Where 

5 I want 

3 Imperative 

12 You have to 
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TABLE 2 

Generic Request Strategies (Spanish) and Sentences Submitted for Judgment. 



Strategy 



Sentence 



1 Usted no podria 
6 Me puede 

4 Puede ... roe 
14 Usted tiene 

11 Si me puede 
9 Quiere 

2 Puedo 

12 Usted no tendria 

5 Donde 

10 Usted no puede ... roe 

3 Me da 



Usted no podria danne el arroz? 

Me puede dar el arroz? 

Puede danne el arroz? 

Usted tiene arroz? 

Si me puede dar el ar<oz? 

Quiere danne el arroz? 

Puedo Hevar el arroz? 

Usted no tendria el arroz? 

Donde est&'el arroz? 

Usted no puede danne el arroz? 

Me da el arroz? 

Deme el arroz. 

Tiene que darme el arroz. 

Oue me de el arroz. 



8 ImperaMvo 
7 Tiene que 
13 Que 



Procedures. After reading and signing a consent form, each subject was 
provided with a set of instructions. The English instructions read as follows: 

The concept known as politeness as it applies to language use has been defined arf 
a speaker's symbolic subordination to the addressee. That is, the speaker intends to 
come across as having fewer rights and/or more obligations (i.e. subordinate) than 
the hearer, 

+ Each row on the next page contains two sentences which express a certain amount 
of politeness. Your task will be to decide which member of each pair is more polite. 
You shcuM ignore context as much as possible. Rely only on the words of the sen- 
tence to tnake your decision about its politeness. Some sentences will seem incom- 
plete. You *»re to treat these in the same way as the others, relying only on what is 
written to make your decision about the politeness expressed.* , 



The Puerto Rican group was administered a translated version of these same 
instructions. In addition, since the experiment was conducted in a group setting 
with the foreign students, the instructions were read aloud to them by the 
experimenter. The subjects from the other *wo groups participated in the ex- 
periment individually. 

Following Edwards (1957) the method of paired comparisons was imple- 
mented to arrive at a scale of deference for the generic request strategies. This 
method required that each strategy be compared' with every other strategy for 
the purposes of scaling. Fourteen strategies, when paired with every other one, 

gave a total of 91 pairs [ UiUJ*> ]. In the American English and foreign stu- 
dent groups each subject could be expected to judge only about 30 pairs due 
to time constraints. Therefore, the 60 native speakers of American English each 
judged about <t third of the 91 pairs, providing a total 20 judgments on each 
pair of sentences. The 75 non-native speakers, judging about a third of the. 
pairs, made the total of judgments for their group 25 for each pair of sentences. 



Example: 
Shut up 



♦ Pleas e be quiet 
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Since more time was available for the Puerto Rican speakers, each of the ten 
subjects made a decision on all of the 91 pairs of sentences. The results of this 
method provided a rank order of the strategies from least to most (Inferential 
and a measure of psychological distance between successive strategies. \ 

Results a*ul Discussion. In the ease of all three groups the raw data x (ie- 
the subjects' perception of politeness of the generic forms) provided a fre- 
quency matrix of the number of times each request strategy was judged more x 
polite than every other strategy. This 14 X 14 matrix of frequencies was con- 
# verted to a proportion matrix by dividing the cell entries of the frequency 
matrix by the number of judgments in each cell. Edwards' (1957) method then 
calls for a transformation of these proportions to a matrix of z scores. The z 
scores provide the normal deviate transformations for the proportions. Then, 
by summing and averaging the z scores associated with each request strategy 
a scale value of politeness is arrived at. This scaled score represents a quantity 
c c politeness attributable to a particular request strategy by a given group of 
subjects. 

The first hypothesis that native-speaking males and females would differ 
in their perception of the politeness of the fourteen request strategies was re- 
jected. In fact, a Pearson product-moment correlation between male and female 
scaled scores was .90, significant at the .001 level. An examination of the rank 
orders of the generib forms shows' the most disagreement on the relative 
politeness of will (strategy 1 in Table 1) and you have (strategy 8 in Table 1). 
In both cases the forms were perceived as more polite b\ Males than females. 
In. general, however, the agreement between male and female native speakers 
far outweighed the disagreement. 

The correlation which corroborates the overall agreement between males 
and females in the perception of politeness masks some interesting differences, 
however. The graphs in Figure one (for males) and two (for females) may 
help to clarify some of those differences. Despite the fact that the request 
strategies are ordered in roughly the same way on both graphs, there are 
obvious differences in the magnitude of the scaled scores (i.e. the height of 
the bars) and the distances between successive scores (i.e. the distances be* 
tween the bars). Males exhibit much smaller scores than females as well as 
a much narrower range from the most to least polite request strategy. These 
phenomena indicate two things: First, as a group, females display more unan- 
imity about politeness than males do. Second, the large increments on the fe- 
male scale, especially between strategies 11 (do) and 10 (can) and strategies 
2 (you can) and 5(1 want), indicate that they are more categorical in their 
perception of politeness than males. Either a form is extremely polite or ex- 
tremely impolite for a female speaker of English. Males, on the other hand, 
exhibit a less categorical perception of the fourteen strategies. 

The perception of politeness among non-native^ sneakers as compared with 
native speakers shows similarity to the congruence bet wen male and female 
native speakers. The correlation between native speaker and non-native speaker 
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perceptions was .89, again significant at the .001 level. Only two strategies (if 
*and can) were outstanding in their differential ranks by native and non-native 
groups. // you can give me some rice? while ranked first by native speakers was 
ranked sixth by non-natives. Can, a fairly neutral strategy for natives (ranked 
seventh) was perceived as relatively polite by non-natives (ranked fourth). 
Several possible explanations can be suggested for these differences. Perhaps 
non-native speakers judged the sentence If ... on the basis of grammaticality 
rather than politeness, causing it to achieve a lowej score than for native 
speakers. Alternatively, native speakers may have provided additional linguistic 
content for this form, choosing / wonder if . . . instead of If. . . . Can might be 
grouped with other modals by non-native speakers in a category of relative 
politeness due to the polite nature of modals in their respective first languages. 
Can, since it is used the most by native speakers, takes on the character of an 
all-purpose request strategy. This heavy use may be perceived as politeness 
by somewhat naive non-native speakers. 

The surface similarity between the native and non-native groups on the one 
hand, and male and female English speakers on the other, continues at a deeper 
level of examination. The graphs in Figures 3 and 4 depict the scale values 
for the 14 request strategies (in order from most to least polite) for native and 
non-native speakers. Comparing those graphs with the scaled scores for males 
in Figure 1 and femal* in Figure 2, we see the following: native speakers 
(Figure 3) look very much like males in both the magnitude of the scores 
(height) and the distance between successive forms. Non-native speakers, in 
contrast, exhibit much larger scale values attesting to the greater unanimity 
of their judgments. This phenomenon was especially true at the extremities of 
the politeness scale. This result implies that non-native speakers of English, 
like female native speakers, show a lot of uniformity in their perception of 
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Request Strategy 

Figure 1. Changes in an individual student's writing speed (in words per minute) over 
12 compositions. 



294 



The Learner in Focus 





Perception of Politeness 



295 



3 




13 



9 



10 



II 



4 



0 



14 



3 



Rtqut ft Strategy 



Figure 4. Graphic*display of scale values of politeness for 14 English request strategies 
judged by non-native speakers (n — 25). 

those strategies they judge to be extremely polite (may, could) and those they 
judge to be extremely impolite (you have to, imperative). 

It is much easier to speculate about the nature of this finding in non- 
native speakers than in female native speakers. The greater overall agreement 
among foreign students than native speakers indicates less differentiation in the 
politeness of request strategies. These second language learners did not demon* 
strate the same gradual increase in politeness from strategy to strategy that 
native speakers displayed. Rather, the request strategies tended to cluster more 
for this group, especially at^he impolite" end of the scale. Thus, the psychologi- 
cal distance between the fourth and fifth least polite forms (Where and you 
can) represent almost 25 percent of the entire range of the scale. 

One possible explanation for the clustering of the modals may, would, 
could, and can at the upper end of the non-native speakers' politeness scale is 
that these forms may have been taught in their English classes. The association 
of fo*ms taught in the classroom with a politeness range seems most plausible. 
The lack of differentiation in the perceptions of the non-native group may stem 
from incomplete knowledge of the force behind the request strategies. The 
lack of exposure to as many varied contexts as native speakers have been 
exposed to also contributes to the clustering Oi strategies at the extremities 
of the scale of politeness. 

In an effort to better understand the complex differences between native 
and non-native speakers' perception of politeness in English, we conducted a 
similar experiment in Spanish with Puerto Rican speakers. We expected the 
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Puerto 'Ricans to demonstrate the same high correlation between males and 
females that the other two groups display. The amount of contact between 
Puerto Ricans and North Americans, the similarity of request strategies in 
English and Spanish, and the amount of language contact all led to this ex- 
pectation. 

Although the range for the two scales is roughly the same (3.09 for males 
and 3.35 for females) the .correlation between the politeness scores is only 
.52 which is not significant. Tims, unlike the high correlations in the previous 
two groups, one cannot predict female perception of politeness from male per- 
ception or vice versa in the Puerto Ricari group. These sex differences are even 
more commanding when one considers the nature of the scales. Neither the 
male nor female scales exhibit any strategies with an extreme" scale value of 
politeness. This means that there is a lack of unanimity among the judges, 
the same phenomenon which occurred in the native English speakers* ratings 
but- which did not result in the non-native group s perception of polityiess in 
English. The fact that the scales exhibited the same range but produced dif- 
ferent results adds to the validity of the claim about sex differences in Puerto 
Ricans' perception of politeness, despite the small size of the sample. 

Reviewing the data, a number of interesting conclusions and several press- 
ing questions emerge. The data tell us the following: (1) Among native Eng- 
lish speakers, males and females perceive the same request strategies as polite 
and impolite. Females, however, reflect a greater degree of unanimity and 
greater distinctions between polite and impolite forms than males do; (2) 
Native and non-native speakers of English show a very high correlation in 
their perception of the politeness of request strategies. Non-native speakers 
exhibit the same phenomenon with respect to native speakers as females do as 
compared with males. That is, the non-native speakers tend to agree more 
with each other and make greater distinctions between polite and impolite 
forms; (3) ftierto Rican males and females show no significant correlation in 
their perception of the politeness contained in Spanish request strategies. 

We can conclude from this research that there may be profound cultural/ 
linguistic differences in the perception of politeness (as evidenced by the high 
male-female correlation in the native English group and the lack of such a 
correlation in the Puerto Rican group). Furthermore, ESL instruction or some- 
thing else in the second language acquisition process allows advanced non- 
native speakers to perceive the politeness of request strategies very mucn like 
native speakers. Their perceptions tend to reflect female native speakers, per- 
ceptions of politeness to a greater extent tnan they do male perceptions of 
politeness. Some of the questions arising from this work are: whether a de- 
velopmental sequence exists in the perception of politeness; vhether other lin- 
guistic groups will reflect the same sex differences that Puerto Ricans did in 
this study; and how the perception of politeness is mirrored in the production 
of these forms by adult second language learners. 
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Indirect Speech Acts in ESL: Indirect Answers 

Potricia L. Carrell 1 
Southern Illinois University 

Acquisition of competence in indirect speech acts — the saying of one thing to 
"mean that but also to mean something else" (Searle 1975:60)— represents an 
important aspect of an ESL learner's overall communicative competence. In- 
direct answers to questions are an interesting type of indirect speech act. 
Rather than answer questions directly, native <spi *kers commonly respond in- 
a directly. For example, a response to "Did you scru» the floor?" might be "Well, 
I swept it." Understanding such an indirect answer requires that the ESL 
* listener have 'he ability to recognize and correctly interpret the communica- 
tive intent of the speaker when that intent is not directly, formally conveyed. 

This paper reports an empirical study designed to test the competence of 
intermediate/advanced adult ESL learners to interpret the communicative in* 
tent of indirect answers. The performance of the ESL subjects is compared to 
that of a control group of adult native speakers. 

Results indicate a significant difference between the two groups in in- 
terpreting the Communicative intent of indirect answers. The ESL group did 
not perform as well as the native group. However, results also show a high 
level of ability among the ESL group, nonetheless. Implications of these find- 
ings for the preparation of communicative teaching materials are discussed. 

1. Introduction 

There has been^onsiderable emphasis recently within EFL/ESL on the 
goal of preparing seSond language learners who are communicatively com- 
petent. As evidence of this emphasis one need only look in the meeting hand- 
books of the 1978 >d 1979 TESOL conventions at the number of workshops ' 
and papers concerned with various approaches toward developing communica- 
tive or functional competence. Of particular prominence are the European 
(mainly Btitigi) approaches to development of functional/notional syllabuses 
and the related work in discourse analysis and English for specific purposes 
(Allen & Widdowson 1978, Candlin 1976). Taking a view of language as com- 
munication and a view of the understanding and producing of utterances as 
a pragmatic achievement, these approaches attempt to interrelate syntactic 
form and semantic interpretation \vi*h pragmatic use. They have a base in 
speech act theory— from the work of Austin (1962), Searle (#39) and others 
—and draw upon theoretical work in a wide range of other related areas, for 

1 The author wishes to thank SIU»C's Office of Research Development and Administra- 
tion, which provided financial assistance for this study, including support for a research as- 
sistant, CaMela Rfcltter, whose help is also acknovledged. Thanks are also due to those on 
campus who granted the author access to the subjects used in this study, including Prof. 
Richard Lanlgan of the Department of Speech Communication, and the Center for English 
as a Second Language (CESL). 
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example, studies in textual cohesion, in language function, in sociolinguistig 
variation, in presuppositional semantics, in ethnomethodoiogy, in the ethnogra- 
phy of speaking, and others. (See Candlin 1976: 238-239) These approaches 
have the goal of offering to the learner "the language he will need to participate 
as speaker-hearer in real acts of communication." (Silva 1975: 342) 

I concur that thk goal is what, we should strive for in L2 teaching and 
learning, and t am in complete agreement with Candlin when he says: ' • 

. . . the only proper goal of a language learning .syllabus (is) that of leading a 
learner to be able to communicate and understand in a foreign language not onty 
the meaning contained within grammatical structure, but also the range of meaning 
which lies outside the surface form, meaning as the communication of information 
* which is negotiated between speakers and hearers in the context of their talk and 

against a backdrop of their beliefs. (Candlin 1976:238) 

Since native English speakers are notorious for exploiting indirect speech acts, 
acquisition of competence in indirect speech acts represents an important aspect 
of an ESL learner's overall communicative competence. 

However, I am interested in logically prior questions; namely, what are 
the communicative or pragmatic abilities of ESL learners? What are the com- 
municative abilities of those who are learning ESL without the aid jof specifi- 
cally designed functional/notional syllabuses? In particular, how well do ESL 
learners interpret the communicative intent of indirect speech acts? The need 
for empirical studi^i to answer these basic questions is clear. Several recent 
papers by myself and others have attempted to give partial answers to these 
questions. One study (Flick & Carrell 1978) examined ESL learners' com- 
petence with indirect requests; in twc other studies I looked at their com- 
petence with presupposed and asserted information (Carrell 1977) and with 
given versus new information (Carrell 1978). The present study atterrpts to 
shed further light on the communicative/pragmatic abilities of ESL learners, 
namely on their abilities to interpret indirect answers to questions. 

Indirect answers represent an interesting area of such empirical pragmatic 
investigation. Rather than answer questions directly, native speakers commonly, 
answer indirectly. For example, an, answer to "Did you scrub the floor?" might 
be "Well, I swept it" 2 Such an indirect answer is appropriate, informative and , 
coherent. In order to understand the meaning of supb an indirect answer, the 
ESL listener must have the ability to recognize and correctly interpret the 
communicative intent of the speaker when that intent is not conveyed directly 
or formally. In order to communicate competently, ESL learners must, possess 
such abilities. The empirical question is, do they? And to what extent? This 
v paper reports the results oi a study designed to test the competence of inter- 
mediate and advanced adult ESL learners to correctly interpret the com- 
municative intent of indirect answers. 

As far as I know, as a category of indirect speech acts, indirect answers 

J This example came to my attention in a rocent Beetle Bailey cartoon strip, reproduced 
here with the permission of King Features Syndicate, 
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have not been studied in ESL, They have been the target of investigation in 
English as a first language by researchers in artificial intelligence and informa- 
tion retrieval svstems. In a recent paper entitled "Why Ask?" Jerry Hobbs (a 
• computer scientist) and Jane Robinson (a research linguist in artificial intel- 
ligence) analyze three categories of indirect answers with the aim of demon- 
strating that appropriate answers must address the higher communicative goals 
of the question and that various kinds of knowledge must be taken into account 
in analyzing the answers appropriateness. (Hobbs & Robinson 1978) 

2. Method' 

Subjects. The ESL subjects who participated in this study were 72 inter- 
mediate and advanced adult students at the Center for English as a Second 
Language at SIU-C. They came from levels 3, 4, and 5 of our five-level pro- 
gram. They were a heterogeneous group, but came predominantly from the 
Middle East, South America and tlie Far East. In the group there were 28 
native Farsi speakers, 17 native Spanish speakers, 10 native Arabic speakers, 
8 native Japanese speakers, and 9 speakers of other languages. CESL students 
placed in levels 3, 4, and 5 characteristically hav° Michigan and CELT scores 
averaging in the mid 50's, mid 60s arM mid 70s at each of those levels, re- 
spectively- The curriculum for levels 3 and 4 consists of six hours per day, dis- 
tributed equally among the basic skill areas: grammar, listening, speaking, 
, reading, writing and language laboratory. Level 5 students attend only 2 hours 
per day during which time all skill' areas are combined. The materials used at 
CESL are among th^ standards for such intensive centers in the U.S. 

A control group of native adult speakers of English was used to insure 
that the performance of the ESL group could be compared to real perfor- 
mance data of natives, not merely to idealized, theoretically predicted be- 
havior. The control group consisted of 29 SIU-C students enrolled in an under- 
graduate general studies course. 

Procedures. The questionnaire.. consisted of 27 short, written dialogues. 
Each dialogue consisted of 3 contextualizing or stage-setting statement describ- 
ing the conversational setting between two interlocutors. This was followed by 
one of the interlocutors asking a yes- no question. The question was then fol- 
lowed by an answer given by the second interlocutor, The subject's task ,T /as 
to draw a conclusion, in the form of a multiple choice response* about the 
- »aning trf the answer. In other words, the subject had to comprehend the 
ation conveyed by the answer and come to a conclusion based on that 

/nation. 

The design included not only indirect answers, but direct an^ unrelated/ 
uncooperative answers as well. A direct answer was defined as one wherein 
the conclusion one comes to based on the answer follows strictly logically from 
the answer, or, in other words, is logically entailed by the answer; no other 
conclusion is logically possible. An indirect answer was defined as one wherein 
the conclusion, whib not following logically from the answer, is a natural, 
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Mcely conclusion to come to. It follows by a process of natural conversational 
mferencinfc (Grice 1975) or natural bridging (Clark & Haviland 1977). An un- 
related answer was defined as one wherein the answer is not related to the 
question >n any natural or obvious way, thereby blocking any natural bridging 
or mferencing about the conclusion. The design was further equa'ly balanced 
among response choices, where affirmative, negative, or indeterminate con- 
clusions were the expected responses. In the case of unrelated answers, only 
indeterminate conclusions were possible. 

Figure 1. Research Design and Sample Items 
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Total 


9 


9 


9 





SOME SAMPLE ITEMS: 

1. b-l* Bob conies up to Ann m the Student Center. Bob says- 

last night? ' Ann says "I had to study last night." 

a. Ann went to the movies last night. 

b. Ann did not go to the movies last night. 

c. I have no ide# at all whether (a) or (b). 

2. a-a Sue and Lisa meet in the shopping center. Sue says: " 
-Lisa says: I bought one at Sears." 



"Did >o U go to the movies 



b-a 



Did you buy a new dress?" 

Jim and Henrv are talkmg abouV a party they went to the night before. Jim says- 
r 1? ;w whlske ^ Henry says: "All I drank was beer. 
7 t~* Cynthia says: " Did 

la^we^ Kenneysays. "Was Joe sick 

iasi wee* rete .says. 1 dicln t see him last week. 

7. t-c James and Harold aryksenssmj? Harolds grandmother. ,n Texas James savs- "Did 
C Zk •• " 8 ***** HamI<1 MyS: " , ri, « ,,Ved ' ^ f~ old 1 frieXf mme 

A table of random numbers was used to randomly order the 27 items on 
the questionnaire. 

'The instrument was administered to the subjects in their classroom groups. 
fnstruct,on S were given both orally and in writing and example items were 
illustrated. Subjects were told to. take as long as they required. In order to be 
certain that none of the vocabulary items posed any difficulty for the ESL sub- 
jects they were told that they could raise their hands and ask about the mean- 
mg of any words they were unsure of. 

Comparing the performance of the control group of native speakers- to 
that .deahzed, theoretically predicted behavior, i.e. to a perfect performance 
shows that the native group did not perform in the idealized manner. That is[ . 
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3. Results 

The descriptive a tical results are reported in table 1. 

TABLE 1 

Descriptive Statistics 





Direct 
Answers 


Indirect 
Answers 


Unrelated 
Answers 


Total 


ESL 
N=72 


M = 8.43 
SD = .802 
R = 6-9 


M = 6.15' 
SD = 1.507 
R = 3-9 


M=6.97 
SD = 2.123 

R = e-9 


M - 21.56 
SD = 3.058 
R = 11-27 


Native 
N=29 


M = 8.55 
SD^.854 
R =6-9 


M = 7.52 
SD = 1.163 
R = 5-9 


M =7.28 
SD - 2.347 
R =2-9 


M = 23.35 
SD = 3.272 
R = 14-27 


Perfect 


M = 9 


M =9 


M = 9 


M = 27 


Random 


M = 3 


M = 3 


M=3 


M = 9 



M (nun) 

SD (lUndftn* d*Ti»tion) 
R (range) 



there was a statistically sigiificant difference between the native speakers' 
mean scores and perfect scops on each of the three answer-types and on their 
total performance as well. See table 2. 



TABLE 2 

Native Group Mean Scores Compared to Perfect Scores 
One Sample t-tests 





Direct 
Answers 


Indirect 
Answers 


Unrelated 
Answers 


Total 


Native 
N =29 


8.55 


7.52 


7.28 


23.35 


Perfect 


9.00 


9.00 


9.00 


27.00 


t-value 


% 2.837 


6.852 


3.947 


6.015 


significance 


! " 

P<01 


P<01 


P<.01 


P < 01 



fnis finding shows that in order to be fair to the ESL group, one ought not 
to compare their behavior to the idealized, theoretically predicted behavior 
but to the actual behavior of the native fflkip 

On the other hand, comparing the performance of the control group to a 
totally random performance indicates that that group was not behaving totally 
randomly, m can be seen in table 3. In fact, the same may be said of the ESL 
grup as well; see table 4. Neither group behaved in a random fashion, indi- 
cating that they were each attending to the tasks, attempting to perform them 
O 
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TABLE 3 

Native Group Mean Scores Compared to Random Score* 
One Sample t-tests 





Direct 
- Answers 


Indirect 
Answers 


Unrelated 
Answers 


Total ' 


Native 
N = 29 


8.55 


7.52 


7,28 


23.35 


Random 


3.00 


3.00 


3.00 


9.00 


t- value 


34.994 


20.926 


9.821 


23.609 


significance 


p<.001 


p<001 


p< .001 


p < .001 


TABLE 4 

Native Group M*»an Scores Compared to Random Scores 
• One Sample t-tests 




Direct 
Answers 


Indirect 
Answers 


Unrelated 
Answers 


Total 


ESL 
N = 72 


8.43 


6.15 


6.97 


2156 


Random 


3.00 


3.00 


3.00 


9.00 


t-value 


57.460 


17.736 


15.867 


34.839 


.significance 


p<.001 


p<.001 


p<.001 


p<.001 



as well as possible, and performing them better than would have been expected 
if they'd been merely guessing randomly at the choices. 

Incidentally, Cronbach-Alpha indices of the reliability of the questionnaire 
for each of the two groups, if the 27 items are projected to tests of 100 items, 
were .86 for the ESL group and .92 for the native group. These figures indicate 
that, although the questionnaire was far from perfect, and by no means a 
normed, standardized instrunnmt, it was a fairly reliable measure. 

Pearson product moment correlations among the sub-tests and their re- 
spective significances are reported in the correlation matrix in table 5 by groups; 
the native control groups correlations are above the diagonal, the ESL group's 
correlations aie below the diagonal. For both groups, the only significant cor- 
relations are between direct answers and unrelated answers. For the native 
group r ^ .46, p = .006; for the ESL group r = .40, p = ,001. The correlations 
involving indirect answers, both with direct answers and with unrelated answers, 
are not significant. This may indicate that the inferencing or bridging required 
in interpreting indirect answers is a different ability, different in kind, from either 
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TABLF 5 

Sub-test Correlation Matrix by Croups 
Pearson Product Moment Correlations and Significances 
Native Group above diagonal, ESL Croup below diagonal 





Direct 


Indirect 


Unrelated 




Answers 


Answers 


Alls Wei 5 


Direct 




.20 


.46 


Arrwers 




p=.15 


p- 006 


Indirect 


-.02 




.16 


Answers 


p-.43 










p=.20 


Unrelated 


.40 


.10 




Answers 


p=.00i 


p=.2i 





interpreting direct answers or determining that an answer is unrelated to the 
question and thus coming to an indeterminate conclusion. 

Within the ESL group, although there were three Jnstructional-proficiency 
levels represented (30 subjects at level 3, 31 subjects at level 4, and 11 sub- 
jects at level 5), there were no statistically significant differences among the 
three levels in their performance on the three answer-types. 3 Nor were there 
any statistically significant differences among the five native language groups 
represented (Farsi, Spanish, Arabic, Japanese, and other). 4 Therefore, for the 
purposes o* this study, all ESL sul'ects were treated as members of a single 
group, differences in their English proficiency and native language being statis- 
tically irrelevant to the basic research question of this study. 

Thus, the basic comparisons of interest for the present study are those 
between the ESL group taken as a whole and the native control group. First, 
comparing the performances of the two groups on the three answer-types, we 
see that the performance of the ESL group was sjighfly lower overall than 
that of the control group of native speakers but was nonetheless quite high. 
Compare the native mean scores of 23.35 total, 8.55 direct answers, 7.52 in- 
direct answers, and T.28 un^ dated answers, to the ESL mean scores of 21.56 
total, 8.43 direct answers, 6.15 indirect answers, and 6.97 unrelated answers, 
respectively, in table 1. As previously shown, these scores for both groups are 
much better than chance. The sub-test scores for both groups can be compared, 
both univariately and multivariately. If viewed univariately, as a series of t-test 



1 Comparisons among the three proficiency .levels were made in two ways. First, viewed 
as a series of univariate (single dependent variable) comparisons, oneway analyses of variance 
were performed, comparing the three proficiency levels on each of the answer-types sepa- 
rately. However, since some of the sub-tests are modestly correlated, it was also decided to 
view the comparisons as a multivariate problem ( multiple dependent variables ), comparing 
the three proficiency levels on all three answer-types simultaneously. Viewed either way, 
there were no statistically significant differences among the three proficiency levels on the 
three answer-types. 

4 As above, the comparisons were made Joth univariately and multivariately. Viewed 
either way, there were no statistically significant differences among the five native language 
groups on the three answer-types. 
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comparisons for independent samples, the only statistically significant difference 
between the groups is in the area of the indirect answers. See table 6. 

TABLE 6 

Comparisons of Native Group and ESL Group 
on Three Answer-Types 
Univariate t-tests for Independent Samples 



# 


Direct 
Answers 


Indirect 
Answers 


Unrelated 
Answers 


Native 
N=29 


8.55 - 


7.52 


7.28 


ESL 
N=72 


8.43 


6.15 


6.97 


t-vahie 
dfciW 


.67 - 


4.36 


.63 


significance 


p> .50 


p< .0001 


P>^3 



If viewed multivariately, as a problem of distinguishing between the two 
groups on the three answer-types as multiple discriminating dependent varia- 
bles, the same result is obtained. The significance of the Wilk's Lambda in 
table- 7 indicates an overall difference between the two groups on the three 
answer-types. A discriminant analysis identifies the discriminant function — the 
linear combination of the dependent variables and their relative weights— 
which best discriminates between the two groups; the standardized discriminant 
function coefficients determine the relative contribution of each of the three 
dependent variables to the discriminant function. As can be seen in table 7, 
most, if not all, of the discrimination is due to one dependent variable — in- 
direct answers. The relative sizes of the coefficients indicate that the discrimina- 
tion between the two groups is not due to the interaction of the three de- 
pendent variables, but is primarily due to the one dependent variable — indirect 
answers. 

In other words; the ESL group did fairly well pn all parts of the interpretation 
task— ESL learners performed comparably, in fact, to the native group on direct 

TABLE 7 

% Comparison of Native Group and ESL Croup C 

on Three Answer-Types " 

Multivariate tests— Wilk's Lambda/Discriminant Analysis / * 



Discriminant 

Function 


Wilk's 

Eigenvalue Lambda Chi-Square DF 


Significance 


1 


.19473 .8370 17.347 3 
Standardized Discriminant Function Coefficients 
Direct Answers — 0.117 
Indirect Answers ^ 0,989 
Unrelated Answers + 0.021 


p < .001 • 
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answers and on unrelated answers — but they did differ significantly from the 
control group of native speakers in performance on interpreting the communica- 4 
tive intent of the indirect answers. 

4. Discussion 

The results yielded basically two interesting findings. First, there was a 
significant difference in the performance on interpreting the communicative in- 
tent of indirect answers between the ESL group and the native group. The 
ESL group did not perform as well as the native grot^g^hey do not possess 
native-like competence in this area. Second, however, the ESL did quite well 
on interpreting all three answer-types, including interpreting the commutficative 
intent of the indirect answers — far better than a chance performance. In other 
words, while the ESL subjects are not comparable to native speakers in this 
area of communicative competence, they exhibit a "high lever "of ability none- 
theless. 

The item which gave the ESL subjects the greatest difficulty were two' 
indirect .answers in which the answer was itself a question — a question with 
a supposed obvious answer— an answer like "Is the Pope Catholic?" According 
to Jerry Morgan (1978), the schema for drawing an implicature in these cases 
is: Answer an obvious yes/no question by replying with Another question whoke 
answer is very obvious and the same as the answer you intend to convey." 
(Morgan 1978: 278) According to Morgan, most Americans have two or three 
stock expressions like this, but new ones may be manufactured as they are 
needed. In this study, one of these obvious ans^ /ers was an obvious affirmative, 
the other an obvious negative: 

Affirmative: Q: Is Becky rich? 

A: Do birds fly? 
Negative: Q: Is Greg smart? 

A: Is Sadat Jewish? 

Somehow the obviousness of the answers eluded many of the ESL subjects. 
Not surprisingly, on several grounds — both psycholinguistic and other, the 
negative proved even more difficult than the affirmative. While only 58% in- 
terpreted the affirmative item correctly, only 4QS> interpreted the negative item 
correctly (despite its timely relevance in the fall of 1978 when the test was 
administered). Interestingly, the control group also had its worst performance 
on interpreting the negative item— only 62* got it correct. (93% got the affirma- 
tive item correct.) 

The generally fine performance by the ESL subjects on interpreting indirect 
answers indicates that they already have a high level of proficiency in com- 
prehending these kinds of indirect speech acts. We might speculate that a large 
measure of this is carried over from first language acquisition or may, perhaps, 
reflect universal linguistic or communication properties. Although all of the 
research that I am aware of in the area of indirect speech acts has been done 
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on English, it seems reasonable to suppose that all languages have mechanisms 
for performing indirect speech acts. Searle (1975) argues: 

... the mechanisms by which indirect speech acts are meant and understood are 
perfectly general— having to do with the theory of speech acts, the principles of 
cooperative conversation, and shared background information . . . (Searle 1975:710 

While there are obviously going to be differences among languages in the spe- 
cific indirect speech forms used to perform different indirect speech acts, (e.g. 
*Can you hand me that book?" may function as an indirect request in English, 
but its Czech translation will sound odd if uttered as a request in Czech— 
Searle 1975: 76), we may speculate that classes of indirect speech acts exist 
universally— e.g. indirect requests and indirect answers. Thus, foreign students 
approaching the learning of English as a second language may, if only subcon- 
sciously, expect to encounter indirect speech act situations, communicative situa- 
tions wherein, as Searle (1975) says: 

^ the speaker coinmunicates to the hearer more than he actually says by way of 
relying on their mutually shared background information, both linguistic and non- 
linguistic, together with the general powers of rationality and inference on the part 
of die hearer. (Searle 1975: 60-61) 

5* Conclusions 

The implications of these findings for the current emphasis in EFL/ESL 
in preparing functional/notional syllabuses, the so-called communicative ap- 
proaches mentioned at the outset appear clear. If we are to construct the most 
effective of these syllabuses, utilizing and building on the underlying commu- 
nicative abilities which are already present and not needlessly re-teaching 
something already acquired, we must understand more about these underlying 
communicative competencies. In other words, we need more studies like this 
one to get at existing communicative competence. We must understand more 
about the extent to which these abilities are present in order to establish the 
base-line data upon which to build. This study shows that in the area of in- 
terpreting indirect answers, there is much there to be built upon as syllabuses 
are written to aid ESL students to achieve native-like communicative com- 
petence. 
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Questions in Task-Oriented Discourse 

Ruth Ciymts and William Potter 

University of Hawaii 

An analysis of the substantive questions in four task-oriented discussions in 
English, each of which addressed the same topics but had different participants, 
revealed the* following question functions: leading questions, open questions, 
challenges, softened assertions, and clarification questions. These question 
functions, except for clarification, are related to the nature and structure of 
task-oriented discourse, but tfj form of the questions in relation to tonction 
and the distribution of ths qucitions in different parts of the discoursettire re- 
lated to the interactional patterns of each particular group of discussants. The 
nature and structure of task-oriented discourse is described and the interac- 
tional patterns of the four different groups of discussants are analyzed and 
compared. 

We have investigated samples of -one kind of oral discourse, the task- 
oriented discussion, -focussing on the questions that were asked in order to see 
what connections we might find between the questions on the one hand and 
the structure of the discourse and the roles of the participants on the other. 
We looked at both question functions and question forms. 

We have analyzed four discussions in English, two by native speakers, One 
by non-native speakers, and one by a mixed group of native and non-native 
speakers. We found that the three non-mixed groups— that is, the groups that, 
were either all native or all non-native speakers — each executed a similar dis- 
course plan but handled its elements and exploited the resources of the lan- 
guage in different though systematically related ways. Jach of these groups 
succeeded as well as the others in communicating withm the group. In con- 
trast, the mixed group of native and non-native speakers had considerable dif- 
ficulty in carrying out the discussion and communicating with each other. 

We will first describe our procedure for collecting the data: Then, in pre- 
senting the data and our analysis of it, we will discuss (1) the structure of 
task-oriented discussions, (2) the roles of the participants, (3) the functions 
of the questions which appeared in the discussions as they relate to the struc- 
ture of the discourse and to the roles of the participants, and (4) the forms 
of the questions as related to their functions. Finally, wc will compare the four 
discussions in terms of these four parameters and conclude with some implica- 
tions for teaching. 

1, Data Collection 

The four task-oriented discussions which we audio-taped and transcribed, 
following the system of Sacks, Schegloff, and Jefferson (1974), were on ESL 
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topics. The discussants were ESL professionals, the non-native speakers among 
them all having a. high degree of proficiency in English. 

The four discussion groups had the following compositions: (1) four native 
speakers of English, Americans in mid-career— three men and one woman; (2) 
four native speakers of English, Americans at the beginning of their careers, 
all candidates for the MA degree in ESL— two men and two women; (3) four 
non-native speakers of English in mid-career from Asian and Pacific countries 
—two men and two women from Burma, Fiji, Korea, and Thailand; and (4) 
a mixed group of native and non-native speakers of English in mid-career— 
, two men and two women from Japan, New Zealand, the Philippines, and the 
United States. The discussions were 12, 26, 28, and 30 minutes long, respec- 
tively. 

The procedure was to elicit a discussion by first giving a written question- 
naire to each of the participants which was patterned after a values clarification 
exercise (Simon, Howe, and Kirclienbaum 1972: 252-254), asking them to 
strongly agree, somewhat agree, sometfrlill disagree, or strongly disagree with 
the following five teaching practices: Teachers should (1) expect students to 
read aloud with good pronunciation; (2) emphasize vocabulary acquisition 
from the beginning; (3) make a lot of use of dialogues; (4) give grammatical 
explanations to their student:,, and (5) teach simple structures before complex 
ones. The participants were then asked to compare their answers and in each 
case where there was not unanimity they were to discuss the statement. As it 
happened, there was no unanimity in any group on any statement. 

Then the task was for the participants to come to a consensus on each of 
the issues raised by the statements, revising the statements as necessary. They 
were instructed to do this orally. We told them that we would listen to the 
tape later and summarize their ideas. We did not tell them that our purpose 
was to analyze questions in the discourse; we told them instead that we wanted 
to compare the ideas of the different groups on ESL pedagogical practices. 
Topics from any area could have been used, but we had access to ESL pro- 
fessionals, and so we solicited their cooperation. 

2. Structure of the task-oriented discussion 

A task-oriented discussion is one in which participants have not only an 
ideational focus for the content of their utterances but also a charge to try to 
reach some community of thought about that focus which has the potential 
of being coherently summarized. To reach a community of thought does riot 
require agreement; it requires only an effort to understand the whole picture 
as it emerges in the discussion, an effort which might then set the stage for 
pbints of agreement or for the decision to agree to disagree or simply for a 
summary. A task-oriented discussion, in its overall function, may be an oral, 
experiential equivalent of the review of the literature in a research report, 
Participants are set, or set themselves, to the task of determining and/or char- 
acterizing the views of their group on some topic or issue. 
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Preferably there would be more than two participants in such a discussion 
since the goal is to ascertain and benefit from collective knowledge and experi- 
ence, but it would be possible to have two participants only. The upper limits 
are flexible within the contraints that space and time impose on oral interaction, 
but the discussion would shift from informal to formal as the numbers got 
larger. 

The most ritualized form of the task-oriented discussion is the panel dis- 
cussion. Less rihialized forms occur, for example, in staff and organization meet- 
ings, in classrooms or conferences where small groups are collectively address- 
ing an issue, and in committees charged with preparing position papers or 
framing resolutions. 

In looking for the structure of the informal task-oriented discussions that 
we taped, we referred to the ritualized form, the panel discussion, and noted 
that it has a beginning, a middle, and an end. At the beginning is the presenta- 
tion of each panelist's view of a set topic; in the middle, opportunity for open 
discussion (\Vnich may or may not be seized); and at the end summation by 
a moderator, who has also presided throughout. 

This structure is also visible in informal task-oriented discussions. There is 
a presider and moderator, roles which we will describe below. And each topic 
is discussed in a sequence of three parts which we have labeled the presenta- 
tion, the exploration, and the accommodation. 

During the presentation each participant explains his or her position rela- 
tive to the topic. The presentation ends when all participants have done so. 
During the exploration participants expand on what they presented earlier, 
respond to the ideas of others, and seek clarification as they feel necessary. 
When they have said what they have to say within their allotted time, they try 
to come to some kind of accommodation. They may take a common position, 
they may explain why they cannot take a common position, or they may just 
■give up and stop discussion of that topic. Or they may, during their efforts at 
accommodation, especially if they feel that someone is pushing accommodation 
prematurely, re-open the topic for further exploration and then later move into 
the accommodation stage again. 
3. The roles of the participants 

We found that in the discussions the participants could assume three dif- 
ferent roles. All assumed the role of discussant by contributing to the sub- ¥ 
stance of the discussion. In addition, one or morfc than one self-selected the 
roles of traffic controller and middleman. 

The traffic'controller presided, iflanaging procedural matters. He or she got 
the discussion going and tried to get it and keep it on track and conclude it. 
One participant might retain this role throughout a discussion, participants 
might take turns, or they might even at some times assume the role jointly. In 
our data traffic controllers said such things as the following: 

(1) Non-native speakers: O: OK then. We'll go down the list then. It's 
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aboutV/O: Uh, OK. Carry on with your, reasons.//0: Yeh, so the 
t point is, shall we se- settle on A[gree] S[omewhat] or shall we settle 
on Disagree] S[omewhat] (sigh). S[trongly] A[gr*ee] and S[trongly] 
D[isagree] are out of the question now. 

(2) Native speakers/mid-career: C. Number one. I'm uhs, uh, strong 
disagreer.//C: How about you?//C: Shall we, can we go to, com- 
promise on disagree somewhat?//C; You could live with that? 

(3) Native speakers/beginning career: B: Uh (2 sec) for4he- questions 

* first of all how did you interpret them?, For what level? Kay did you 
lump them all together under one Itevel er, when you were, thinking 
of these items?//T: So I don't know, if<lle*re that many, = B:°That 
we can make— that you could— T. == general statements that we can 
make. B: Maybe we can put in qualifiers.// M. What about four 
"Give grammatical explanations" = B. M to the students." T: What 
did you— 

The middleman mediated, managing substantive matters. This role, too, 
might be assumed by one person throughout, who might or might not also be . 
the traffice controlIer%r the role might shift from one participant to another. 
The middleman tried to help the others formulate their ideas, clarify what others 
had said, and make generalizations based on what others, had said. For ex- 
ample, note the utterances of G (native speaker/mid-career) and O (non-native 
speaker) in the following excerpts. 

Topic: Teachers should emphasize vocabulary acquisition from the begin- 

• ning. 

W. I said "Disagree Somewhat." Uh (2 sec) Mm. (5 sec) Well, I 
don't know, I, hm, I feel that tuh they'll learn, what they need 
to learn. Now- "'rom the beginning' I don't know that's the hard 
part but, (3 sec) 

(4) G. (Well) would you put it in their way? Would you put, new vo- 

cabulary in their way, so they'd have to learn? (2 sec) 
W. No I wouldn't. 

(5) G. You wouldn't, deliberately (do anything about it)? 

W. No, of course I'm envisioning the idea of— the ideal classroom, 
where the students interact and learn English, they'll learn, that 
vocabulary right? (Laughs) 

Topic: Teachers should have students read aloud with good pronunciation, 
(near end of discussion; 

(6) G. Yeh. Well it seems two things are mixed up. You know, if you're 

teaching uh pronunciation, you're uh probably not at the point 
where you're going to have them read aloud anyway. It seems 
as though the two things are, out of focus. 




312 



The Learner in Focus 



Topic: Teachers should give grammatical explanations to their students. 

(7) C, What would be your opinion, of giving grammatical explanations 
at the, initial stages [of teaching?} 
L. L N °1 not at a11 



I don't 

(8) O. 



think that.] 

You would] strongly disagree then? 
L. No, I would strongly disagree. 
(9) O. So you're you're speaking in terms of the level of your [L. Right.] 
learners. Yeh? 
L. I guess all my answers here [because I have= 

( 10) O. L Yes > of course ( ) 

yeh related to your-] 
L. = decided to- base my answers on-] 
O then turns to Q and asks her to give her reasons. 

A person asking a leading question, which O's and G's questions t 4, 5, 7) 
above exemplify and which we will discuss below, was by definition assuming 
the role of middleman. A middleman also makes mediating and summarizing 
statements as. in (6), or even statements for another person as in (8, 9, 10). 

In asking questions, the participant in th° role of middleman may some- 
times be trying to get a particular answer, especially in the accommodation 
\ stage, since, unlike the moderator in a panel discussion, who is only a middle- 
\man, the middleman in these informal task-oriented discussions also wears the 
hat of discussant, as illustrated below in (11, 12). 

Topic: Teachers should teach simple structures before complex ones. 

G. Suppose you take a (task) of learning for instance. 
C. Yeh. OK, 

(11) G. Do you teach the simple before you teach the complex? 
C. Yeh! (3 sec) 

C. (To W) Then you gotta move then. 

(12) G. Is it any different with language? (4 sec) 
W. °Maybe. 

R. Might be, yeh. Might be that you teach the complex and then 
you find out what the simple things are that make up the com* 
plex. I don't know. (4 sec) 

G. Well fd be willin to move. 

W. (Laughs) 

G. I'm not sure — uh what I meanhhhh. 

(R and G laugh. End of discussion on this topic.) 

G has tried to lead others to his position. But he ends up uncertain about 
what his own position is. 
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4. Question functions 

The questions thatwe examined were those which addressed jbstantive 
matters relating to the topic under discussion. We did not exarrn.. ictorical 
questions, questions about rjfocedural matters, or questions not rr'jvant to the 
topic, in identifying which utterances were questions we used as criteria both 
the various question forms and the expectation of a response. 

We identified five question functions a*s follows: open questions, leading 
questions, challenges, softened assertions, and questions of clarification, this last 
of two types — one asking for confirmation of the acoustical signal and the other 
of meaning. 

We befieve that these five categories of functions are exhaustive for our 
data, but we. recognize our, whole framework as preliminary and we call atten- * 
tion to the ultimate subjectivity of ou»- interpretations of speaker intentions. Fol- 
lowing are discussion and illustration of each of these question functions. 

Open questions^seek information. They are for the benefit of the questioner. - 
In the explorati' a stage of the native speaker/mid-career discussion of Teach- 
ers should have students read aloud with good pronunciation," W and C make 
it clear from their responses to G and R's reasons that they don't think that 
good pronunciation has any relevance to reading. But the four discussants 
don't seem to understand that W and C are associating the reference to "read- 
ing aloud" with the reading process and G and R are associating it with oral 
production, G asks an open question of W and C. 

(13) G. Well if you two had them read aloud, in the first place, what 

would you expect? 
C. Oh f d try for uh, well I don't know if I would uh, I would have 
em read aloud. 

W. °Yeh, but uh, you w-ildnt expect them to, [C. °Yeh.] read 
aloud. 

H has tried to seek information, but G and W have answered evasively because 
tiie* question is meaningless to them. They are operating on a different assump- 
tion from G about the purpose of reading aloud. 

Jn the native speaker/beginning career discussion of "Teachers should teach 
simple structures before complex ones" we finJNhe following open question in 
the exploration of what complex means. 

(14) B. . . . What else would be complex? 

» T. Well of course it s- all very relati. to what you're teaching, I 
mean some of the things we're talking about, I suppose might 
be considered yaknow simple, to one class and very difficult to 
another, even, even tenses, something like past perfect, and 
present perfect, some people ytfknow I've had .students who have 
real trouble keeping the, the time structure right. You never 
know. 

B. Yeh I would think those are more complex. 
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Leading questions help another formulate or clarify his or her ideas or try 
to lead the group to reach points of agreement. Whereas open questions are for 
the benefit of the questioner, lepuing questions are for the benefit of the respon- 
dent In asking a leading question, a participant is assuming the role of middle- 
man, as illustrated in (4, 5, 7) above, which occurred in the presentation stage, 
and in (11, 12) above, which occurred in the accommodation stage. 

Challenges call into question * • reasonableness of someone's assumption 
as the questioner perceives that assumption. In the native speaker /mid-career 
^(discussion of "Teachers should have students read aloud with good pronuncia- 
tion" the excerpt following (13) above contains three challenges addressed by 
R to W as follows: 

W. °Yeh, but uh. you wouldn't expect them to, [C. °Yeh.] read 
aloud. 

(15) R. Never? Can't think of any circumstances under which you'd have, 

people read aloud? 
V/. Oh maybe uh, you know "What what sentence you got there" 
f- for [teaching writing.] 

(16) R. [How about poetry?] 
W. Cli no. I wouldn't uh, 

(17) R. Poetry shouldn't be, said aloud? read aloud? 
C Performed aloud] 

W. [Mm.] The idea of good pronunciation is the part, partieularly- 
j (Laughs) (You'd say uh) correcting their pronunciation all the 

time. 

R. I didn't say anything about correction. . . . 

A challenge often provides more fnsight into the ideas of the questioner 
than the answer provides into the ideas of the respondent. Considerable ex- 
ploration is often required to determine if challenger and respondent are talk- 
ing out of the same assumptions or not, and they may never identify their source 
of difficulty. 

The clearest example of such difficulty occurred in the group of two native 
and two non-native speakers. The topic was whether or not teachers should 
have their students read aloud with good pronunciation. All disagreed except Y 
(from Japan), who agreed somewhat. The discussion got shifted from exploring 
the relation between reading and pronunciation to the topic of what the stan- 
dard of pronunciation should be. The stage was set for this shift when Y said 
" that the situation in Japan was special, that the learners there needed to read 
aloud because the only way that they could practice spe'aking English was to 
have their teachers, who were not good English speakers themselves, "manipu- 
late good materials," as he said, for choral reading for the purpose of providing 
a model for the students. What he apparently meant was that in Japan the 
teacher would play tapes, the learners would listen and then later read out 
loud what they had listened to and the teacher would evaluate the students 



Task-Oriented Discourse 



315 



pronunciation. His assumption, as we interpret the total discussion, was that a 
teacher does not have to have good pronunciation to help the students acquire 
good pronunciation. , 

After A (from the United States) first ascertained that Y was supporting 
reading aloud as a pronunciation exercise, she then asked him an open ques- 
tion, "And the, who is the model for that?** When he answered that recordings 
were the model, A came up with three challenges in the 'ensuing discussion: 

A. But if, if you're using the reading as a pronunciation exercise, 
for your students in Japan and you said that the teacher, uh, 
many times didn't have very good pronunciation 

( 18) [T. No:.] then who is the model, who do the tea- do the students, 
model their pronunciation, after so that they get the good pro- 
nunciation, when they read?, 

7 Y. Ah; ; They have the record, they have the tapes and the : : they 
have lots of materials for example TV program and radio pro- 
gram as well. • ^ 
Z. Are you suggesting that there wouldn't be any transfer? 
A. Well I, I think if you were going to ttse it in this way, I'm I 
just- I do- I mean that, I wouldn't use it that wayhhh. But if you 
we;re= 

Y. Since we are not native [speakers,] we have to us< , 
A. s [Yeh] 

A. Mmhmn. But what my poi*, my point is, if you're going to- use 
it as ° pronunciation exercise, 

( 19) how do they know whether they're pronouncing it, uh like 
[uh,]= 

Y. [You] mean the- 

A. = the teacher wants it ^pronounced? If and especially if the 
teacher, i- if he, as you said, if the teacher doesn't have good 

(20) pronunciation, then how can they improve good pronunciation, 
by reading, out loud? 

Z. Do you think that this is an important facet F? This, pronuncia- 
tion?. . . . 

Apparently A concluded that there was no reasonable answer to her ques- 
tion, but she kept asking it in order %et her own point across, which, however, 
she failed to do. Y stuck to his ide at Japanese English teachers have to use 
recordings in a reading aloud lesson since they are not native speakers. Z*s lead- 
ing question to A, "Arf you suggesting that there wouldn't be any transfer?" 
shows that Z has some otion of Y's assumption. Z thinks that the whole dis- 
cussion is beside the point but evidently does not \yant to offend Y, and so he 
addresses a challenge, at the end of the above exchange, not to Y but to F, the 
other non-native speaker, from whom he expects and receives agreement: "Do 
you think this is an important facet F?" F agrees that it is not. Z is trying to 
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get the group back to the original topic, hut he does not succeed. They shift to 
talk about what the standard of pronunciation should be, Japanese English or 
native speaker English, and they never get back to the original issue. 

Softened assertions state ones belief or understanding and at the same time 
seek confirmation of it from others. In the native speaker/m^career discussion 
of teaching vocabulary from the beginning, - has tried to 1^ w t0 malce her 
position clearer (see examples 4 and 5 above). He has asked: "You wouldn't 
deliberately (do anything about it [vocabulary] ?" W\s answer concludes with 
a softened assertion directed to the whole group; 

W. No, of course I'm" en-envisioning the idea of- the ideal class- 
room, where the students interact and learn English, and what- 

(21) ever they needs they have, they'll learn, that vocabulary right? 
( Laughs ) 

C. Wrong. 
(All laugh) 

When asked by a middleman, a softened assertion states the middleman's 
understanding of the group s position, as in the following excerpt from the native 
speaker/beginning career discussion of teaching vocabulary. 

B. In other words these restricted books in the beginning where 
they have* a limited vocabulary like if you have a beginning 
class, [M. Mmhmn.] uh. there several series that have very 
limited [vocabularies.] 
T. Seven hundred 

(N) 

T. =hundred words, 
B. Yeh. 

M. Hnh. ♦ 

(22) B. And' that ch- you-think that right from the beginning there 

should be a beefing up of vocabulary? 
M, Attention drawn to it. Uh I don't think you have to use con- 
trolled materials. 
B. Yeh, OK. 

In our data questions of clarification occurred only in the mixed group dis- 
cussion. This is probably partly happenstance and partly a consequence of the 
fact that when people assume communication is proceeding, as in the non-mixed 
discussions, they don't feel the need for asking questions of clarification so much. 

At one point in the mixed group discussion, A needs to confirm what was 
said because of overlapping talk. She has asked Y whether he would use read- 
ing aloud as a reading exercise or as a pronunciation exercise. 
Y. Mm. Yes. Choral reading- 
A. As- 

Y. [Group reading.] 
F. I Pronunciation.] 
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(23) A. As pronunci 



ation?] 

Pronunciation, yes. 



It was common in this discussion for F (non-native speaker) to answer for Y 
(nongative speaker) 

In the discussion of whether teachers should teach simple structures before 
complex ones, F presents her argument against structurally graded materials 
and Y asks a question to confirm his understanding of what kind of materials 
she is referring to in (24) below: 

F. Well. Uh, if we uh (2 sec) in in structural linguistics, uh the 
audio-lingual approach, we are told or taught, that it's uh, °you 
know, it's pedagogically effective if we start with simple struc- 
tures before we move on to complex ones, but uh, actually there 
is no, uh psychological reality to this kind of theory. . . . Now, 
uh, I think if you structure your syllabus in this manner, you are 
uh restricting as I said, the progress of the student and you 
come up with, with eh, lessons probably that is not eh, that is 
uh, that doesn't reflect reality because in reality students actually 
make use of a lot of embedded structures and this type of thing 
so- I don't think there is psychological reality to, uh to this uh 
kind of belief that y- you, yaknow, you succeed uh better if you 
start with the simple structures before you move on to, the more 
complex ones, 
y (N) Mm. 
r Y. I see. 

Z. That's quite true, you know, that's borne out in our country as 
far as the teaching of reading is concerned. 
(24) Y. Are you referring to some kind of the er automatical uh auto- 
matic, the substitution drill? or the parroting ths 
uh changing] from one to another? 

F, [Yes that's-]= 

F. =:Yes, tha- that's based on this. 

5. The form of the questions 

Elsewhere we have reported on the forms of the questions asked in the 
three non-mixed discussions (Crymes and Potter 1979). We noted that in 
the question forms there was a cline of assertiveness from most assertive to 
least assertive as follows: Affirmative Statement with Question Tag-* Polar 
Subject Verb-* Polar Subject Verb Elliptical -* Polar Verb Subject -+WH. 
For the question functions described in the preceding section, the unmarked 
forms, excluding questions of clarification, were as follows: Affirmative State- 
ment with Question Tag for softened assertions, Polar Subject Verb for strong 
challenges, Polar Subject Verb elliptical for challenges, Polar Verb Subject for 
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leading, and WH for open Questions. But not only were there stronger and less 
strong challenges, there were also marked forms which were less assertive for 
the other functions, as follows i Polar Subject Verb instead of Affirmative State- 
ment with Question Tag for softened assertions; Polar Verb Subject instead of 
Polar Verb Subject Elliptical for even weaker challenges; and WH instead of 
Polar Verb Subject for leading questions. Note, for example, (7) above, where 
the WH question is a leading question. It is answered with a M ndftjJ| if >* w * re 
a Polar Verb Subject question. We called "these marked forms drwnsliifting, 
meaning that there was a shift from more to a less assertive form. 

In the mixed group discussion we found examples of double downshifting. 
These were WH questions which were not interpretable as open questions. 
They were challenges. Evidently in these cases the questioner was challenging 
not what someone had said but the existence of an answer. That is, the ques- 
tioner evidently felt that no reasonable answer was possible. But not wanting 
tc come on with too strong a challenge, the questioner downshifted two steps 
to the WH form. Examples are (18, 19, 20) above. 

As we have indicated, it was only in the mixed group discussion that we 
found questions of clarification. They were, all polar questions of various forms. 
They were not challenges, because it was "not clear what, if anything, could he 
challenged, nor leading questions, because they referred back to what had 
already been said. The questions of clarification in our data appear to be side* 
sequences (see Jefferson in Coulthard 1977^.73-74). 

See Chart 1 for a summary of the correlations between question functions 

and question forms. TABLE 1 

Question Form/Functidn Correlations 
(Broken line = downshifting) 



Functions 



Form 



Function 



Softened 
assertion 

Strong 
challenge 

Challenge 

Leading 

Open 



* Affirmative tag 

-t, — »Pol*r SV 

»Polar SV? elliptical-* 



Polar VS ^ 



Clarification 



-+WH 



6. Summary and conclusions 

Chart 2 provides characterizations of the four discussions. 

The native speaker/beginning career group was the least managed and 
often built up communal utterances for both procedural and substantive com- 
munications, They asked more questions than the non-native speaker/mid- 
9 , - 
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career group but fewer than the native speaker/mid-career group. Half their 
questions were open and a lit lie under a third were challenges. Almost half 
were downshifted and almost all occurred during exploration stages. 

The native speaker/m id-career discussion was quite firmly managed though 
there was some sharing of authority. The participants asked the most questions 
even though they spent much the shortest time in discussion, and all of them 
were about equally active as questioners. They asked almost half leading ques- 
tions and a third challenges and distributed their questions about equally over 
rhe three stages. There was no downshifting. 

The non-native/mid -career discussion was the most firmly managed, both 
procedurally and substantively, and the fewest questions were aaked by the 
fewest participants. Half the questions were leading and over a third were 
challenges. Most occurred during the presentation stage, and somewhat more 
than half were ddwnshifted. 

The non-mixed groups thus all differed from each other in quantity and 
quality of management, number of questions and questioners, and distribution 
of the questions in the different parts of the discourse. But the native-speaker/ 
beginning career and non-native speaker/mid-career speakers shared a propen- . 
sity for downshifting, and the mid-career speakers, both native and non-native, 
favored leading questions and challenges whereas the native speaker/beginning 
career speakers favored open questions. Age, professional status, and cultural 
background were all operating in various combinations' as influences. 

In contrast, the mixed group had difficulties moving Jong in their dis- 
cussions, both procedurally and substantively. There was a struggle for the role 
of traffic controller. The role of middleman was seldom assumed. The partici- 
pants asked the most questions of any group, but all but one of the questions 
were asked by thfe two native speakers. Over half were questions of clarifica- 
tion and almost a third were challenges, most of the latter doubly ddwnshifted. 
Further, it was only in the mixed group discussions that one person answered 
for another: one of the non-native speakers quite often answered for the other 
non-native speaker. 

At least one or the other of the two of us investigating this data knew per- 
sonally each of the discussants, and we recognize that some of the interaction 
that we have noted stemmed from individual personalities. Yet the three non- 
mixed discussions were different in degree rather than in kind from each other, 
and the mixed group was different in kind rather than degree from the other 
three. 

Excluding the questions of clarification, the functions of the questions ap- 
peared to be primarily determined by -the nature and structure of the discourse, 
and their form and distribution appeared to be primarily determined by the 
interactional pattern of the group. Where there was not a common interactional 
pattern, as there was not in the mixed group, there may not in fact have been 
any more lack of mutual understanding of the substance of the discussion than 

ERJC 321 



320 



The Learner in Focus 



in the other groups, but there was more unease and frustration. The number 
of clarification questions asked were in part a manifestation of this frustration. 

TABLE 2 
Characterizations of Four Discussions 



NS/BC = Native speakers/beginning career NN/MC == Non-native speakers/mid-career 
NS/MC = Native speakers/ mid-career MG = Mixed group/mid-career 







Rules of questioners\_ 




, r Traffic, Controller (TC) 






Initially 


Thereafter 


"^MlHojleman 


NS/BC 


Highest status 
(seniority) ? 


Shared by all 


Assumed by x- all as a communal 
persona 


NS/MC 


Highest status 
(rank) 


Shared by two others but 
highest status predominates 


Primarily the one who is never 
TC, minimal participation by 
chief TC 


NN/MC 


Oldest 


Oldest 


Oldest 


MC/MC 


Oldest (NN) 


Oldest keeps trying but 
usually superseded by 
other NN 


None 





No. of 

qqes- 

tions 


No. of 
ques- 
tioners 


Question functions 
( ) = downshifting* 


Distribution of 
ques in discourse 






Open 


Lead- 
ing 


Chal- 
lenge 


Sof. 
asser. 


Clar. 


Pres. 


Expl. 


Acc. 


ll 


NS/BC 


10 


Mainly 
2 


5 


1 


3(3) 


KD 


0 


1 


8 


1 


26 


NS/MC 


12 


4 


1 


5 


5 


1 


0 


4 


5 


3 


12 


NN/MC 


8 


2 


0 


4(3) 


3(2) 


1 


0 


5 


2 


1 


28 


MC/MC 


2? 


Mainly 
2 


. 3 


1 


7(7) 


2 


9 


0 


22 


0 


30 



7. Implications for teaching* 

Given the variability of question functions and forms and the distribution 
of questions in discourse depending on the discourse structure and the inter- 
actional patterns, we agree with Candlin (in Coulthard 1977:xiir) that "learners 
.ieed to begpme analysts of discourse themselves," 

We would suggest that transcribed passages of task-oriented discourse from 
various English-speaking groups, including both native and non-native speakers 
in various combinations, along with the taped record, be studied by both ESL 
teachers and their students. We would envision as a first step the study of a 
passage which has been annotated in terms of discourse structure and selected 
language functions — for example, here this person is presenting reasons, here 
another is challenging someone, here they are exploring such and such an issue, 
here one person is trying to lead the others towards consensus, etc. A follow- 
up step would be to have students work closely with other, similar passages 
which have not been annotated, figuring out what interactions are going on 
O 
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between the participants, summarizing the ideas 1 of various speakers and talking 
about why a particular participant asks, for example, a particular question and 
how that question and the response moves or does not move the substance of 
the discussion forward. 

We would hope that in this way the language, teacher and learner world 
gradually become sensffeve to the range of possibilities that the language offers 
and the circumstances most commonly associated with the various options, and 
that in an informal ancj fairly indirect way, the learner would develop that 
systematic versatility which marks the communicatively competent person. 
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Oral and Written Syntactic Relationships in 
Second Language Learning 

Roberta J. Vann 

lowd State University 

This paper reports on a study which examined the oral and written language 
of second language learners. The study had two main objectives: to discover 
more about the relaiofShip between the oral and written syntactic units and 
structures of non-native speakers of English and to examine the reliability and 
utility of certain widely used indices of syntactic maturity. The experiment in- 
volved 28 Arabic-speaking adult subjects who were studying ESL. Comparable 
oral and written data samples were elkrted by having subjects respond orally 
and in writing to a short film without narration. Transcribed data were analyzed 
using five indices of syntactic maturity. Teacher^aluators also assessed oral 
and written data samples as a validation procedure. The indices were examined 
for their value as indicators of target language proficiency by comparing them 
with subjects' scores on a standardized test of English proficiency. Results sug- 
gest that second language learners tend to differentiate English oral and writ- 
ten expressive style? in ways similar to native speakers with written discourse 
containing fewer words, longer mean T-unit* length, and fewer errors of certain 
kin is than comparable oral data. At the same time, this study casts doubt upon 
the usefulness of the T-unit and dependent clause ratio, indices which dis- 
regard learner errors, for use as measures of syntacfic maturity of non-native 
speakers. 

Although the link between oral and written language has been a topic of 
philosophical debate for centuries, this issue has elicited relatively few empirical 
investigations. The dearth is especially apparent in the area of applied lin- 
guistics. This is p>, bably due, at least in part, to the until recently widely 
accepted notion of the primacy of speech and the consequent neglect of written 
language. For the pillars of linguistic thought the situation was clear: only 
speech was truly language and writing merely a means of recording it—certainly 
not a system in its own right. This attitude is exemplified by Wilhelm von 
Humboldt's statement which referred to writing as "only an incomplete mumi- 
fied depository which needs, for tall understanding, an imaginative oral recon- 
stition" (Salus, 1969:184). Other linguists who followed suit included Saussure - 
(1959), Sapir (1921), Blomfield and Hockett (Vachek, 1973- 1041). Over the 
., years, other linguists and psychologists have tended to agree with the Humboldt 
school, some to the extent of confusing speech with language and/or thought. 
This tendency drew the following comment from Vygotsky: 

We can trace the idea of identity of thought and speech from tiu speculation of 
psychological linguistics that thought is 'speech minus sound' to the theories of 
modern American psychologists and reflexologists who consider thought a reflex 
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inhibited in its motor part. In all these theories the question of the relationship be- 
tween thought and speech loses meaning. If they are one and the same thing, no 
relationship between them can arise. Those who identify thought with speech simply 
close the door on the problem (1962:2). 

> . ! 

In a similar way, those who have reduced writing merely to speech' written 
down also have attempted to shut a door on an important issue. In spite of 
notable exceptions to this conception of language (Vl§!fck, 1973; Smith, 1975; 
Householder, 1971; and Bolinger, 1975) all of whom addressed the differences 
in oral and written language and dealt with writing as a system in its own right, 
it has been the view represented by behaviorists with their emphasis on verbal 
behavior and structuralists with their stress on speech as primary which has 
had a significant impact on foreign language teaching, particularly in the audio- 
lingual approach in which the student is typically not allowed to write until 
after he/she has gained control over the structures orally. The assumption is 
that oml and written language exist linearly (oral proceeding written) with the 
implication that this is the natural developmental order for adult second lan- 
guage learners as well as for children acquiring their first language. Likewise, 
some versions of the direct method, in attempting to simulate the environment 
of first language acquisition, have maintained that speech in the target language 
be mastered before literacy skills are tackled, This rationale of natural older 
^has been argued against by Ausubel who states that it is in fact . . unnptural 
to assume that after an individual becomes literate, he will learn in the same 
way as 1 en he was illiterate* (1964:423). Ausui^l's view is echoed by propo- 
nents oi >gnitive-code theory which encourages learners to use all their senses 
in assimilating material. Thus the cognitive-code approach, as did direct 
method, grammar Uanslation, and audiolingual, makes an implicit assumption 
about the relationship of spoken to written language. Yet only a shallow em- 
pirical foundation for such assumptions exists. 

Of the studies which have attempted to examine this relationship, most 
have been with children acquiring their native language. For that reason such 
work served as a basis for the present stucfy. Inevitably research which examines 
syntactic relationships as does this one, must tackle the problem of finding an 
appropriate index of proficiency such as nj^an length of Utterance (MLU), 
widely used by psychologists in studying first language learners, or the T-unit. 
The latter has been utilized by Hunt (1965) and others as a measure of syn- 
tactic maturity in English-speaking school children and later by second lan- 
guage learning researchers including Monroe (1975), Cooper (1976), Gaies 
(1977), Larsen-Freeman and Strom (1977) and others. 

The controversy surrounding the matter of what constitutes an adequate 
measure and the problems involved in refining this index, although also the 
concern of the present project, are not the central focus of this paper. Rather, 
1 a reporting here on the findings iri^his investigation which deal specifically 
with .he relationship of the, oral and written syntax of a group of adult second 
language learners at varying stages of their English language development/ By 
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collecting samples under carefully controlled conditions, the two products have 
Twwfoomparedf in an attempt to discover more about the relationship in itself 
as well as howTNnay relate to what we know about the oral/written deyeloo- 
meot of Li. * \ 

L Subjects \ 

Twenty-eight adult male Saudi Arabian subjects who had spent approxi- 
mately five months in the United States and were enrolled in post-graduate 
courses in education, as well as ESL courses, provided the data used in this 
prefect. TOEFL scores, which were collected as a means of validating the 
language proficiency of die subjects thus enabling a realistic interpretation of 
the results oJ this study, as well as allowing for the possibility for future studies 
replicating this one, ranged from 337 to 570 with a mean of 420. The average 
subject was about 30 years old and had studied English in Saudi Arabia for 
about seven years. None of the subjects had ever previously lived in an English 
speaking country. 

2. Procedures 

jn order to elicit comparable oral and written data, I sought an elicitation 
device which would naturally evoke a monologue. Largely for this reason, F 
chose a motion picture stimulus. This also follows the suggestion made by 
Scott and Tucker in an article concerning their study on oral/written relation- 
ships in L» (1974:96) in which they stated that a film would be preferable to 
still pictures in encouraging subjects to be expressive. Silences, a 12 minute 
color film without narration has the added advantage of being of a subject 
matter, which while of adult appeal, is a film none erf the subjects had seen. 
Nor had they been exposed to the plot— a story set during World War II in 
which a Yugoslavian peasant aids a wounded German enemy soldier. 'Because 
of its emotional intensity and thought-provoking final scene, the film evoked a 
strong interest in subjects and thus appeared to increase the likelihood of their 
wanting to respond to it. This was an important concern in choosing the stimu- 
lus. In the past, language researchers have sometimes neglected the need to 
present subjects with material which truly stimulates language production. 

The subjects watched the film in groups of six. Following the film, three 
of the subjects were chosen at random to preceed immediately to one of three 
interview rooms where each subject told an interviewer who had not seen the 
film what the film was about and what his response to it was. Responses were 
recorded on an unconcealed tape recorder. At the end of approximptely 10 
minutes, interviewers sent the subjects to complete the writing phase of the 
experiment. Meanwhile, the other three subjects had remained behind and had 
been instructed to write a composition in which they told someone who had 
not seen the film what happened in it and their own opinion of it. These subjects 
were given approximately 20 minutes to complete the task before being sent to 
perform the oral phase of the experiment. By having half of the subjects write 
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first while half spoke first, the possible effect of sequence could be examined. 

Following data collection, oral recordings were transcribed orthographi- 
cally, using conventional punctuation, then checked and verified by a phoneti- 
cian who looked for possible discrepancies between the tapes and typed tran- 
scriptions. Finally, five minute segments were extracted and the transcriptions 
were re-typed. Written compositions were simply typed adhering to subjects' 
spelling, spacing and punctuation. 

After excluding certain data considered extraneous, transcriptions were 
marled for T-units, mazes, dependent clauses and error-free T-units. The T-unit, 
the basic method of segmentation in this study, is defined by Hunt (1965:21) 
as eaqh utterance grammatically capable of being considered a sentence. Thus 
all simple and complex sentences constitute one unit, while all compound sen- 
tences are sliced with the coordinate conjunction forming the first word of the 
second unit. Thus: 

The man is a soldier/ and he wear^ a uniform (2 units) 
The man is a soldier who is wearing a uniform ( 1 unit ) 

Words or strings which do not constitute T-units were labeled mazes. A 
maze, for the purposes of this study, is similar to what some researchers have 
referred to as a garble. Typical examples are semantic redundancies, unintel- 
ligible words or strings, and false starts. T-units were considered error-free if 
they made sense in the given context and were free from syntactic and lexical 
error. (In the case of written samples, spelling and punctuation were disre- 
garded as criteria in determining whether or not a unit was error-free unless 
the spelling made a difference in meaning or grammatical correctness, e.g. draw 
for drew would have been tabulated as an error. This is closest to the procedure 
used by Scott and Tucker rather than Larsen-Freeman in similar studies. The 
implicit assumption here is that spelling and punctuation are "purely scribal 
conventions 0 (Hirsch, 1977:163) which probably occur at a relatively late stage 
in the writing process as the individual moves from an idea to a final grapholog- 
ical form. 

3. Findings 

In addition to computer analysis using the syntactic indices, the oral/ 
written relationship was examined through holfstic evaluations of the data by 
teacher-raters Three highly-trained ESL teachers were asked to evaluate the 
oral and' written data-in 2 separate sessions according to their usual criteria for 
judging EFL composition. The procedure was a simple one: raters were asked 
to group^ata into 3 stacks according to the level of proficiency represented. As 
can be seen from Table 1, the raters tended to evaluate oral and written samples 
similarly to the way in which they had judged written ones. As can be seen 
from the high concentration of subjects along the left-right diagonal, those who 
were rated high, average or low one task were likely to retain this ranking for 
the other task as well. 
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TABLE 1 

Raters' Groupings of Subjects on Oral 
and Written Assignments 



Row 

Groups I n HI Total 

17 3 0 10 

U 2 4 3 9 

m 2 1 6 9_ 

Column 

Total 11 8 9 28__ 



Because judgments on the two sample groups were not truly independent 
in that the same raters judged each set, it cannot be concluded that good 
speakers make good writers or vice-versa. In fact, even if Independence of rate* 
judgments could be established, because the study was designed to disregard 
certain language features such as pronunciation *n the oral samples and spelling 
and handwriting in the written ones, conclusions about transfer of proficiency 
from speaking to writing are necessar!**' limited. 

The raters were also utilized ir, determining whether sequence of oral/ 
written task influenced subjects' performances. Chi squared tests indicated that 
rater assignments on both oral and written assignments were independent of 
sequence. This finding shows that at least in terms of the conditions set up in 
this study,, evahiators could detect no influence on a subject's oral performance 
as a result of his having written about it first, nor on his oral performance as 
a result of his having spoken about it previously for 10 minutes. This finding 
also allowed all subjects to be pooled together as one group in subsequent 
analyses. 

These syntactic analyses revealed that the relationship between oral and 
written expression was apparent not only in the raters' holistic evaluations, but 
in terms of mazes, error-free T-unit length, and percentage of eiTor-free T-units 
(see Table 2). This rneafis, for example, that a subject's oral mean length of 

TABLE 2 

Correlations between Oral and Written Indices 



Indices 


R 


P 


X T-Undt Length 


.100 


.16 


% bependent Clauses 


.186 


.17 


% Maxes 


.547 


.00 


X Length EF T-Units 


.457 


.00 


* EF T-Unit/T-Units 


.518 


.00 



error-free T-units is likely to be reflected in his written mean error-free T-unit 
length. But the reader will note that for two of the indices, mear T-unit length 
and ratio of dependent clauses, there was no such correlation between oral and 
written performance. While it is possible that this result is due to the inad- 
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ejuacy of the measures used with ESL subjects, it cannot be said with assur- 
ance that $ood speakers are good writers or vice-versa. 

Since a portion of this^ study was devoted to examining these language 
proficiency indices. I decided to use multiple regressioi step analysis to deter- 
mine which indices correlated with TOEFL scores, and whether the oral or 
written version was the better correlate in the case of each respective index. 
I. also used multiple regression step analysis to determine the two best cor- 
relations with TOEFL scores, using first oral and then written data, I found 
that the oral ratio of mazes to words, the oral and written mean length of 
error-free T-units, and the oral ratio of error-free to total T-units correlate 
moderately with TOEFL scores. I also found that for use with the written 
sample, mean length of error-free T-units combined with ratio of error-free 
T-units is the most valuable discriminator, ni!e for the oral sample, the rat » 
of error-free T-units combined with oral mean length of T-units accounts for 
the largest amount of variance. Note however that when combined with the 

TABLE 3 

Mi<mgi*!»t hulk os Correlating with TOF.FL Scenes 
for Written and Oral Data 



Written * \djusted 







Multiple R 


R 


R 3 


Step 1 


Mean length of 
Error-Free T-Units 


.583 
(p = .0O) 


556 


.310 


Step II 


Ratio of Error-Free 
T-Units to all T-Units 


.607 
(p = .00) 


.555 


.309 


Oral 








Adjusted 






Multiple R 


R 


R 2 


Step t 


Hatio of Error- Free 
T*Units to all T-Untb 


.590 
<p = .00) 


563 


,318 


Step II 


.Jean length of T-Units 


(p=.00) 


.609 


.371 



previous index, it slightly lowered the amount of variance accounted for. 
Thus for written data, the error-free T-untt, especially its mea* length, ap- 
.pears to be the best indicator of adult foreign language proficiency. It should 
be remembered that the 31 percent variance figure is deceptively low, since the 
TOEFL score represents an individual's overall English proficiency including 
listening comprehension, reading skills, and so forth. Thus ever if a perfect 
measure of n syntactic proficiency existed,, it would be unable to account 
for all the variance in the overall TOEFL score. 

In terms of the oral data, we note that ratio of error-free T-units to total 
Tmnits proved the strongest correlation with TOEFL scores, accounting for 
almost 32 percent of the variance when used alone. If oral mean length of 
T-units is added and used in combination with the ratio of error-free T-units, 
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the amount of variance accounted for ris^ to 37 percent. Thus the mean length 
of T-units becomes a more useful index when combined with the ratio erf error- 
free T-units. Again this suggests that syntactic length does function as a measure 
of language proficiency, but that errors need to be accounted for in order for 
it to show power as a discriminator. # 

In summary, those indices which take errors into account appear to be 
the best discriminators of language proficiency. It is interesting to note, how- 
ever, that mean length of error-free T-units discriminates in terms of subject 
language proficiency similarly to the way in which mean length of, T-units 
seems to discriminate according to the language ^ficiency of a native-speaker. 
.Thus the Jength of a syntactic unit, when errors are taken, into account, does 
have value as a discriminator of language proficiency for the individuals in 
this study, and results from other studies support the conclusion that this has 
wicler application to other groups of language learners. (See Monroe, 1975 and 
Cooper, 1976). 

As a follow-up study, the question was posecUas to how raters had evaluated 
the oral and written samples in relation to each of the five oral and five written 
syntactic indices in this study. Woul^ results be similar to those using the 
TOEFL as the criterion variable? In other words, would the same indices 
prove to be the strongest discriminators^ This did, in fact prove to be the case, 
with mean length of T-units and percentage of dependent clauses failing to' 
discriminate between the high and low proficiency groups at a statistically 
significant level. The other three indices.. however, were statistically significant 
as discriminators. 

This suggests that when raters are instructed to use their own criteria for 
evaluation, they tend to focus on a subject's errors in combination with his 
T-unit length in determining the level of his performance. On the other'liand, 
the same raters do not appear to be substantially influenced by a subject's T-unit 
length or the percentage of his dependent clauses in his discourse. Once again, 
there is evidence that such indices, while functional in discriminating proficiency 
levels of native-speaker discourse, do not work effectively with non-native adults 
learning English, at least with the subjects in the present study. 

In addition to the findings discussed thus far, certain aspects of the analysis 
resulted in information which lends itself io comparison with findings from first 
language acquisition studies— specifically those which have compared the oral 
and written discourse of English-speaking school childnu (O'Donnell, Griffin, 
and Norris, 1967; and Loban, 1976). 

1) Oral compositions 'were almost tWice as long as w itten ones, although 
oral segments represented only one-fourth the time allotment of written ones 
(5 minutes of oral taping versus 20 minutes of writing). Hie oral mean number 
of words produced was 422.9, while the mean number of written words pro- 
duced was 214.8. These results are similar to those of other studies which have 
compared oral and written discourse. However, the discrepancy may be even 
greater with this group of fc&L learners than with native-speakers. 
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2) Mazes were plentiful in speech and rare in writing. This is similar to 
Loban's finding with native-speaking school children. 

3) Both mean length of T-units and mean length of error-free T-units 
were longer in written than in oral discourse. O'Donnell, Griffin, and Norris 
( 1967:8J ) report that in this study with native-speaking children, mean length 
was greater in oral than in written expression until Grade 3 after which the 
trend reversed. Thus the subjects in the present study appear to have more 
in common with O'Donnell s post Grade 3 subjects. 

4) The ratio of adverbial and adjectival clauses to T-units was higher in 
written than in oral language, especially in the case of adjectival constructions, 
the percentage of which more than doubled in written expression. 

These findings suggest that the adult foreign language learners in this 
study, similar to native-speakers in previous studies, tend to use a more elabora- 
tive and> more strictly controlled written than oral style. It is elaborative in the 
sense that T-units are longer with a larger percentage of adverbial and adjec- 
tival clauses and more strictly controlled in that fewer words are produced in 
a given time allotment with mazes representing a very small proportion of these 
words. Still open to speculation, however, is the question of how much this is 
a function of language training — in the case of the subjects in this study train- 
ing in English in a Western country which persumably exposed students to a 
more elaborate and finely controlled written style as opposed to a more free 
oral style characterized by shorter syntactic units. 

Yet, for whatever reasons, it does appear that evidence from the oral/ 
written language relationship of these subjects offers some support for the 
similarity of first language acquisition and second language leurning even when 
the two groups are widely removed in age, native language, and cultural back- 
ground. 

4. Implications 

While lending additional support to the notion that second language learn- 
ing closely approximates the developmental sequence of first language acquisi- 
tion, the present study also opens up a vast range of further research questions 
from the merely interesting {e*g. how universal might this phenomenon be?) to 
the pragmatic and pedogogieal (e.g how might we help good speakers become 
better writers?). 

Unfortunately, we are not likely to discover the answers to such questions 
until we have developed better methods of data evaluation — both through 
holistic rating and ohjeetise means. Thus the goal of developing more refined 
methods of measuring language proficiency must be among our highest priorities 
in future language research. 
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This paper describes the results of a computer-assisted analysis of 152 com- " 
positions written over a period of seven weeks by 14 lower-intermediate level 
EFL students. Measurements on these compositions estimate changes in gram- 
matical complexity, grammatical error frequency, grammatical error type, flu- 
ency, vocabulary size, spelling, and punctuation. Such measurements provide 
an objective look at some short-term changes in the writing skills of adult 
learners of English. 

Single compositions are unreliable representations of writing proficiency, 
but comparisons of average measurements on several compositions reveal a 
number of changes in composition skills. Several such changes depend on the 
first language and culture of the learner. A comparison of composition changes 
with changes in placement test scores indicates that some important develop- 
ments in riting are not reflected by standard discrete-point tests. A, compari- 
son of measurements for eighteen compositions with ESL teachers' subjective 
ranking of those same compositions shows that teachers consistently base their 
rankings on the same limited set of criteria. 

CK'er tr*s past several years I have taught writing skills to adult foreign 
reamers oi English. Like most such teachers, I have been aware of substantial 
differences of ability among the students in each class. Some were atrocious 
spellers; rubers spelled English accurately. Some r **de many grammatical 
errors, while others were able to eliminate or avoid rtually alj such errors. 
Some organized their writing in a way that matched my American sense of 
order; others wrote in ways that seemed to me disjointed and illogical. Again, 
like most teachers of writing, I had an intuitive sense that I was helping my 
students to improve their writing. Although all students had off-days, I sensed 
some improvement between the beginning and end of an eight-week intensive- 
course session. But until last year, I had no concrete evidence to support my 
intuitions v Changes that I sensed were subtle rather than dramatic. The bad 
spellers were still bad spellers, but maybe not quite so bad. The same gram- 
matical errors occurred, but perhaps not quite so often. 

About a year ago, when I was teaching a class of '14 lower-intermediate 
adult students of English as a foreign language, I decided to measure, with as 
much accuracy and objectivity as possible, some of the changes that occurred 
in the writing of the students in my class over a period of seven or eight weeks. 
What follows is a description of those changes. This description is based on a 
computer-assisted analysis of 152 individual compositions. For each of these 

* I am grateful to Ann Borkin^for suggestions that led to several revisions. 
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compositions I was able to obtain an approximation of vocabulary size, fre- 
quency of errors in grammar, spelling, and punctuation, writing speed, and the 
grammatical complexity of sentences. One of my objectives was to determine 
whether any of these measurements suggested changes in student performance 
over the eight weeks of the course. Another objective was to determine whether 
these measurements were related to the sojte of intuitive judgments that com- 
position teachers make about their students' writing and also whether changes 
in these measurements corresponded with changes in standard language- 
achievement test scores. 

1. Method 

The students. The information presented here is, 1 believe, an accurate 
representation of some changes in my students' n rittm^tfnjrlllh but you must 
decide whether for your purposes my students should be considered "typical". 
In age, almost all were in their late teens or twenties. Only one student in 
the class wa^in her thirties. The class represented a variety of different coun- 
tries in Latin America and the Middle East; one student was from Japan. 
All but three of the students spoke either Spanish or Arabic. They had come 
to the United States td study English in preparation for study at an American 
university. 

Students were placed in my class based on an objective test measuring their 
grammatical knowledge, their knowledge of English vocabulary, their reading 
and oral comprehension. This test did not measure writing* ability directly. 
When my students were given a similar test at the end of the eight-week 
session, all but one had improved enough to be advanced to the next course 
level. * 

During the term, students wrote twelve timed in-class compositions. Topics 
for these compositions alternated between descriptions of short animated films 
and other topics based on the discourse strategy treated during that week — 
chronological order, cause-and-effect, and so on. These compositions, a total * 
of 152, constitute the data for my analysis. 

The measurements. Table 1 describes the nine measurements made on each 
of the 152 compositions. 

The average number of words per minule of writing time is assumed to 
reflect the water's fluency. Because I saw the care that each student gave to 
his or her writing, I am confident that writing speed does not simply measure 
the students" ability to spew out minimally coherent sequences of English 
words. The students were never pushed to write more or to *vrite faster. They 
took pains to write carefully. 

The length of T-units— that is, of single independent clauses together with 
all modifying subordinate clauses— was proposed by Hunt (1965) as a measure 
of syntactic complexity. T-units are generally lengthenpd uy sentence embed- 
ding or by the ellipsis of repeated elements in compound sentences. Hunt found 
that T-unit length reflected the verbal maturity of school-age writers. 
9 
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TABLE 1 

The Nature of the Measurements and the Abilities Measured 



measurement 



ability of writer 



I. 



Average words pet minute of w riting 
time 

Avenge length of T-units _ 
Percentage »of error-free T-ffcits 
Average length of error-free T-units 
Type-token ratio 

Average number of grammatical errors 
per 10Q words 

Average number of misspelled words 
per 100 words 

Percentage of T-units ending with a 
punctuation mark 

Semantic errors as a percentage of total 
errors \ 



fluency, articulateness 



2. 
3. 
4. 
5. 



grammatical sophistication 
grammatical sophistication 
grammatical sophistication 
size of active vocabulary 



7. 



ability to correct or avoid grammatical 
errors 

ability to spell English correctly 



9. 



ability to punctuate written English 
appropriately 

seriousness of grammatical errors 



Since adult learners of English often write long T-units by translating 
sentences from their native language word-for-word into English or by using 
interlanguage rules to combine English words, longer T-units in the writing 
of fore'qn adults may not indicate a more sophisticated control of the gram- 
matical constructions of English. To overcome this difficulty Scott and Tucker 
(1974) use a different measure of grammatical sophistication: the percentage of 
T-units that are free of grammatical errors. 

Larsen-Freeman (1978) describes yet another measure of the grammatical 
sophistication of foreign learners of English. Like Hunt, she calculates the,. ' 
average length of T-units, but unlike Hunt, she includes in her calculation only 
those T-units that are error free. 

A type-token ratio is frequently used to estimate active vocabulary size. 
The variant of-that measurement used for this study is the number of word 
types (i.e. different words) divided by the square root of twice the total num- 
ber of words in the text. Thus, if the writer used the same limited number of 
words over and over again, the type-token ratio for his/her composition would 
be a relatively small number; whereas the type-token ratio for a writer using 3 
a wider variety of words would be a f larger number. 

The frequency of grammatical errors in each composition was measured 
as the average number of errors per 100 words of text. As in any error analyjis, 
these ar ~rs from the viewpoint of a native speaker. Presumably if the 
learner* lad perceived them as errors, he or she would have corrected 



Spelling proficiency was estimated by the number of misspelled words per 
100 words of text. 

A rough measure of the writer's capacity to punctuate English sentences 
Svas provided by calculating the percentage of T-units ending with a punctua- 
tion mark. (Virtually all T-units require some sort of terminal punctuation*) 
This measurement, however, does not reflect the writers ability to provide 
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necessary punctuation within T-units, nor does it take into consideration tie 
appropriateness of the particular punctuation mark used at the end of each 
T-unit. 

Some errors distort or obscure the meaning of a sentence. Such an error 
is committed, for example, when a learner writes ihe verb Trilled with the 
intended meaning died or uses present tense in reference to past events. Oh the 
other hand, when a writer fails to make the subject and verb agree* or puts an 
adverb in the wrong place, or hopes to doing rather than hoping to go, he 
proclaims his foreignness, but does not distort his meaning. Semantic errors, 
errors that distort meaning, are more serious. Consequently a decrease in the 
ratio of semantic errors to total errors indicates an improvement in the quahty 
of the learner's errors. 

Eac# measurement in Table 1 is associated with an ability generally con- 
sidered important in the writing of English. Note, however, that these abilities 
are not measured directly. Such things as fiuency and grammatical sophistica- 
tkm. can not be counted or calculated. Consequently, references below to 
changes in these abilities are only as valid as the association between each 
ability and the corresponding measurement or measurements Note in addition 
that some of the most important concerns of writing teachers are not included— 
such things as organization, relevance, tone, and cohesion. These are missing 
because I was unable to measure them objectively, and such omissions limit 
the scope of this analysis of student writing. . 

2. Discussion • 

Tb^e procedure , that revealed changes in composition skills most clearly 
turned out to be one of the simplest. The twelve compositions were divided 
into two groups: those written during the first half of the course and those 
written during the second half. For each half, the individual composition scores 
for each student were averaged. Then these -average scores for each of the 
fourteen students in the class were themselves averaged, arriving at a single set 
of average or, mean scores for the first six compositions and another set of 
average scores for the last six. These scores are presented in Table 2. 

There was clearly" significant improvement in three of the nine measure- 
ments comparing the first six compositions with the last. A significantly higher 
percentage of T-units were punctuated in later compositions. As indicated by 
the type-token ratio, students used a significantly larger vocabulary in writing 
the later compositions. Finally, later compositions were written slightly more 
ttpidly, an increase in writing speed averaging about one word per minute. 

Two other measures were within five percent of the accepted .05 signif- 
icance level, and therefore suggest some other likely changes in the Writing of 
my students. The level of misspelling decreased by roughly .8 words per 100 
and between three and four percent more of the T-miits in the later composi- 
tions were error free. 

There was no significant change in the frequency of grammatical errors. 
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TABLE 2 

First HaK— Last Half Comparison of Average Values 
for Nine Measurements — All Students 



measurement 

1. errors per 
100 words 

2. aveaaje words 
per T-imit 

3. error-free 
T-unft 
frequency 

4. average words 
per error- 
tree T-unit 

5. terminal 
punctuation 
frequency 

6. type-token 
ratio 

?. writing 
speed 

8. frequency, of 
misspelling 

9. semantic 
error fre- 



first 6 

compo- 
strjons 


last 6 

compo- 
siikjiu 


difference 
{last— 


sigiiif* 
ioance of 

uintpi dins 


17.35 


16.77 


-.58 




10.56 


9.99 


-.57 




25.21 


28.89 


3.68 


_ 

C 


6.43 


6.23 


-.20 




80 JO 


45.02 


4.13 


- b 


431 


4.47 


,16 


a 


5.84 


7.02 


LIS 


a 


7.66 


7.04 


-.62 


c 


36.24 


34,63 


-1.61 





(N = 14) 

Ona UH#<t paired T int 

• p < .01 - 

* p < .05 , , 

/ 

This result surprised me since a major portion of clajs time was spent in* 
error-related exercises and discussions and since students were required to 
rewrite their compositions correcting grammatical errors. Again contrary to 
my expectations, the average length of T-units did not increase. If anything* 
it seems to have decreased since the average length of all T-units and the 
average length of error-free T-units both decreased slightly. 

-Among the students in my class there were six Latin American students 
whose native language was Spanish and five Near-Eastern students who spoke 
Arabic as a native language. It was therefore possible to compare the per- 
formance of these two groups over the course of an eight-week period when 
they Here receiving virtually identical training in English. 1 

Table 3 compares the average proficiency level of these two groups of" 
students on each of the nine tests. The results are not surprising, Spanish 
speakers were significantly better at spelling and punctuating English— almost 
certainly due to the consistent similarities between the writing systems of 
Spanish and English and the equally consistent differences between the graphic 
conventions of Arabic and English. Spanish speakers also used a larger voca- 
bulary in their writing, as one would expect given the vast number of English 




courses 



^hls training included four intensive courses (reading, writing, sneaking and' listening), 
a total of 20 hours per week, This particular group of 14 .students worked together in all four 
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TABLE 3 * 
Comparison of average Scores for Arabic and Spanish 
speakers on nine measurements 



nu^ti«mw^ Native language^ Significance 
measurement Arabic Spanish of difference 

L anon per 16.28 17.41 

100 



1 averoae words 10.63 10.49 
par T-unit 

3. error-free 28.08 27.68 
T-unit 

frequency 

4. average words 6.21 0.25 
per error- 
free T-unit 

5. terminal 72 .86 86.13 b 
punctuation 

frequency 

6. type-token 4. 18 4.56 



ratio 
speed 

* of 1318 3.84' 

misspelling 

9. semantic 33.24 36.45 

error fro* 
quency 

Nf (5) (6) 

Two-Ull#d T-tsst 
*p<.05 



b 



Varitne** for the meu uretnent of nii*ip«llinf frtqtuncy art $tou\j upoqual. Two ArtWc-ipf»klBf 
*tad*nU h*d MYer* tpclUnf dlttcultlM; sit otktr •tud«nU w«r* r«*ton*bljr good iptUin, 

Cognate and loan words in Spanish. On all of the .other tests, the performance 
of Arabic and Spanish speaking students was not significantly different. 

But the changes in Writing skills that occurred during the eight-week course 
were strikingly different* These differences are represented in Tables 4 and 5. 
As Table 4 indicates, significant changes in the writing skills qt the Spanish- 
speaking students were their increased writing speed and their increased ca- 
pacity to atfoid semantic errors. Also notable were improvements in spelling 
and in the frequency of error-free, T-units. However, there was a possible 
decrease in the average length of error-free T-units. It is possible that Spanish 
speakers in the class were able to write more error-free T-units partly because 
they were writing shorter T-units. 

As Table 5 indicates, none jQftfie changes in scores of Arabic-speaking stu- 
dents were similar to changes amoHg the Spanish-speaking students. Instead, 
Arabic-speaking students improved Significantly in their ability to punctuate 
English sentences and in the size of/Aeir active vocabulary. The writing of both 
groups changed, but in quite different ways. 

Characteristics of the learners first language have long been considered a 
source of his or her generalizations about a second language and consequently 
a source of learner errors. The present data suggest that differences in language 
or culture (I cannot determine which) also affect the direction and nature of 
the language learning process. Different cultural groups may require different 
O • 
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TABLE 4 

First half*— last half comparisons of average values 
for nine measurements — Spanish students 





first 6 


last 6 


difference 


signif- 




compo- 


compo- 


(last — 


icance of 


measurement 


sitions 


sitions 


first) 


difference 


*. errors per 


17 Q1 


i a oi 


— .HO 




10n u/nrrla 

iw wuiua 










iVCIMHC WUllU 


10 R7 


in 07 

lU.Ul 


Ail 




per i -unit 










«j. cirur*iTce 


OK ill 


oU.l I 




c 


T* unit 

I -unit 










frequency 










4. nve rage words 


O.Oo 


K Q1 


AA 
— .00 




per error- 
irec l ■ tin ii 


- 
















5. terminal 




Qfi in 


4.01 




mm etna Him 










r ran t lan^v 
I idj UC1H 7 










fl frVTV-tflKPfT 


4.49 


4.64 


.15 




ratio 










# « writing 


A 91 


/.04 




L 

D 


speed 










8. frequency of 


4.13 


3.54 


—.59 


c 


misspelling 










9. semantic 


39.43 


33.39 


-6.04 


a 


error frc- ' 










quency 










(N = 6) 










One tailed paired 


T test 








■ p < Ot 










*p< 05 










c p< I 




TABLE 5 







First half — last half comparisons of average values 
measurements — Arabic students 



measurement 



first 6 
compo- 
sitions 



last 6 
compo- 
sitions 



difference 

Jblast— 

*first) 



signif- 
icance of 
difference 



1. 


errors per 
100 words 


16.16 


16.35 


2. 


average words 
per 1-umt 


11.02 


10.26 


3. 


error-free 

T-uijit 

frequency 


23.83 


28.50 


4. 


average words 


6.28 


6.18 




per error- 
tree T-tinit 












5. 


terminal 


68.50 


76.76 




punctuation 








frequency 






6. 


type-token 
ratio 


4.11 


4.26 


7. 


writing 
speed 


5.73 ^ 


6.62 


8, 


frequency of 


13.86 


12.54 




misspelling 




33.65 


9. 


semantic 
error fre- 
quency 


31.97 


(N 


- 5) 








Pirn tailed paired T teit 








*p< 05 







.19 
-.76 
4.67 

-.10 

8.26 

.15 

.89 
-1.32 
1.68 



b 
b 
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curricula to compensate for differing tendencies to develop in some writing 
skills but not in others. 

To determine the consistency of teacher judgments in the ranking of com- 
positions, I compiled a set , of eighteen compositions all written on the same 
subject by students in my class. These compositions were given to a total of 
13 ESL teachers and teaehers-in-trainiug who were asked to rank the composi- 
tions from best to worst. No specific criteria for making judgments were sug- 
gested; tie rankings were accepted. 

' These teachers were surprisingly consistent in their rankings; there was 
general agreement as to the ordering of compositions roughly 60 percent of 
the time. 2 The teachers themselves were surprised by their consistency; they 
found very difficult the task of ranking compositions so similar in quality. 
Yet all of the teachers seemed to be using roughly the same criteria in judging 
compositions. What were those criteria? More particularly, were any of the 
writing skills considered important by teachers captured by the the nine mea- 
surements used for this study? 

In fact, three of the nine measurements were related to the rankings given 
by the teachers; they are. in decreasing order of importance: writing speed, 
frequency of grammatical errors, and frequency of spelling errors. 3 There was 
a correlation of .91 between these three measurements considered together and 
the average teacher- ran kings. By far the single most important predictor of 
teacher rankings was writing speed, that is, the length of each composition. 
* It would be uncharitable, however, to assume that composition teachers were 
judging quality simply by length. I suspect that students who could write faster 
also wrote more articulately, and that it was this less tangible characteristic to 
which teachers were attending. 

Because all of the compositions ranked by the teachers were written on 
the same topic, it was possible to obtain at least a rough measurement of the 
relative amount of information conveyed by each writer. I guessed that this 
measurement, in addition to the original nine, might.be associated with the 
judgments made by teachers in ranking compositions, but in fact it was not. 
Apparently teachers, in making their judgments, were attending more to the 
form of writing than to the completeness of its content, 

In ranking papers written by my students, teachers were attending to char- 
acteristics associated with length and with the frequency of grammatical and 
spelling errors. It seems that the sorts of changes that occurred in the writing 
of the Latin American students were valued more highly by writing teachers 
than the sorts of changes that occurred in the writing of Near-Eastern students. 
One might expect that these teachers would consider Latin American students 
better writers of English than Arabic students. But such generalizations are 

' Baaed on an analysis of principal components using individual teacher rankinm as 
variables. 

3 Based on a statistical procedure that selects independent variables for a regression 
model. 
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dubious. Had the. compositions themselves been different — written on a dif- 
ferent topic or by students more divergent in ability — the criteria used by 
teachers to evaluate them might also' have been different. 

Although teachers were surprisingly consistent in their judgment of my 
students 9 compositions, my students were inconsistent in the quality of their 
writing. As a consequence, a single composition did not provide reliable evi- 
dence of a students writing ability. Figures 1, 2 and 3 illustrate the problem. 
These three graphs show the range of scores over twelve compositions for a 
single student on each of the three measurements most closely associated with 
teacher judgments, that is, speed, grammatical error frequency, and frequency 
of misspelling. 

Although* a comparison of the writing speed of the first composition repre- 
sented in Figure 1 with the speed of the last would suggest a dramatic change 
in writing speed, this comparison would be misleading. Looking at the pattern 
of change over all of the compositions, it is difficult to decide whether speed 
has increased. Ignoring the first and last compositions, it would almost seem 
that speed is decreasing. Similarly, the drastic drop in error frequency in 
Figure 2 between the first and last composition does not, as the rest of the 
graph shows, indicate a gradual decline in the number of errors made by this 
writer. The decrease in errors is only apparent in the last two compositions. 
In Figure 3, the fluctuation among compositions totally obscures any gradual 
change that might be occurring in spelling proficiency. 

This sort of evidence casts doubt on the practice of evaluating a student's 
writing proficiency based on a single composition or of evaluating changes in 
student writing by considering a single pair of compositions. Unfortunately, ac- 
curate evaluation of student writing skills seems to require a larger set of com- 
positions as evidence. 

Students were placed in my class on the basis of a multiple-choice lan- 
guage proficiency test measuring grammatical knowledge, vocabulary size, and 
comprehension of written and spoken texts. It is often assumed that such tests, 
although they do not measure writing skills directly, do indirectly indicate the 
level of writing skill. My analysis of student compositions gave me a basis 
for testing that assumption. In particular, I was able to determine whifch if any 
of the changes in writing scores proved to be good predictors of changes in 
proficiency test scores. In fact, I discovered that of all the* nine measurements 
only vocabulary size was significantly related tQ scores on the proficiency test. 4 
Since part of Jhe proficiency test specifically measured knowledge of English 
vocabulary, trM association is not surprising. What is surprising is that none 
of the three measurements most valued by composition teachers was associated 
with proficiency test scores. Students with very similar proficiency test scores 
may differ in their mastery of important writing skills, and consequently when 
proficiency tests are used to place students in ESL classes, writing teachers are 
likely to have heterogeneous classes. 

4 Based oti a statistical procedure that selects independent variables for a regression 
© Jel. 
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FIGURE 2 

Changes in an individual student's 
grammatical error frequency (errors per 
100 words) over 12 compositions 
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FIGURE . 

Changes in an indiviuua 1 student's 
misspelling frequency (misspellings per 
misspelling frequent/ 100 words) over 12 compositions 
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In summary, the study of my students' compositions led me to five con- 
clusions. First, my students did improve with regard to some of the writing 
skills I was able to measure. The most notable improvements were in writing 
speed and in vocabulary size. Second, the writing of students in my class from 
different language and cultural backgrounds changed m different ways. These 
differences were due, I belk /e, to differences in the knowledge, experience/ and 
expectations that the students brought to the class rather than to differential 
treatment within the class itself. Third ESL teachew were surprisingly con- 
sistent in their evaluation of compositions written by my students. Moreover, 
they based their evaluations primarily on criteria related to writing speed, 
frequency of grammatical errors, and frequency of misspelling. Fourth, a Single 
composition provided an unreliable basis for judging the writing ability of an 
individual student. Similarly, comparing two individual compositions was 
unreliable procedure for evaluating change in composition skills over a period 
of seven or eight weeks. Finally, the objective tests used to evaluate the* lan- 
guage proficiency of my students and to place them in language classes did 
not measure their proficiency as writers. 

These conclusions are based on a single group of students all within a 
relatively narrow range of English proficiency, all roughly the same age and 
all studying together in the same ESL intensive -course. They are v in no sense 
a random sample of second language learners of English. Groups with different 
characteristics might not behave in the same way. I believe, however, that my 
study and my conclusions may be useful in directing further inquiry; and I 
suspect that at least some of my conclusions may prove to be valid in class- 
rooms other than my own, I hope, moreover, that my study will encourage 
others to continue the development of techniques for measuring short-term 
changes in the performance of language learners/ Only when such measures 
exist can teachers rationally evaluate the success of their teaching. 
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Syntactic Skill and ESL Writing Quality 

Patrick T. Kam««n 

Syracuse University 

A more thorough understanding of the relationship between syntactic skill and 
ESL writing quality — an understanding based on a solid body of empirical data 
rather than on vague intuitions — will better prepare composition teachers to 
help their ESL students learn to write. 

In this paper suggestions are made for replacing vague intuitions with 
objeotive measures of syntactic skill, with these suggestions based on the results 
of a recent exploratory study analyzing 40 syntactic factors in 50 compositions 
(25 "good" and 25 "rJoor" compositions) written by college-level ESL students. 

The differences l>etween the two groups of writers were simultaneously 
statistically significant at the .05 level in terms of (1) T-unit length, (3) clause 
length, and (3) incidence of dynamic passive constructions. 

The implications of these findings are discussed and suggestions are made 
for the use H such statistical counts^of syntactic factors, the wider use of 
sentence combining exercises, and a re-examination of present attitudes toward 
the passive voice. 

Intuition, rather than empirical evidence, has long been the basis upon 
which composition teachers have judged the writing skill of their ESL students. 
Unfortunately, the intuitions handed "dovyn through the years from rhetorician 
to composition teacher offer few insights into the relationship between writing 
quality and syntactic skill (see Hunt 1977). - 

The purpose of this exploratory study was to determine if there is a cor- 
relation between syntactic skill and scores assigned to compositions written by 
(Jbllege-level ESL students. * 

1. Method 
♦ 

Subjects, A randomly-drawn sample of 50 compositions written by college- 
level ESL "students (25 "good * and 25 "poor" writers ) / at Syracuse University 
was used. Fourteen different native languages were represented in the sample 
(see Table 7 1). 

Task. Each composition analyzed was a '30-minute timed writing on an 
assigned topic given as pare of the Michigan Test of English Language Pro- 
ficiency. Because the test vvas administered a number of times in,*i976\ several 
different topics were used (see Table 2). 

Procedure. Each composition had been graded holistic-ally on a scale of 
100 by two highly-qualified, independent raters with over twenty-five years of 
experience in grading compositions written by ESL students. This researcher 
did not grade any of ihe compositions. Iii no case did the raters disagree by 
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TABLE 1 
Native languages of subjects 

Language "Good" Writers "Poor" Writers 



(1) Arabic- 
jt) Chinese 


10 


10 


1 


0 


(3) Farsi 


1 


4 


(4; rrencn 


i 
i 


u 


(5) Hebrew - 


1 


0 


(6) Ibo 


2 


0 


(7) Italian 


I 


0 


(8) Japanese 


0 


1 


( 9 ) Lamhya ( Malawi ) " 


2 


0 


(10) Malay 


3 


0 


(11) Portuguese 


0 


1 


(12) Spanish 

(13) Thai 


2 
0 


7 

2 


(14) Urdu 


1 


0 



Task — Each composition analyzed was a 30-mmute timed writing on an 
assigned topic given as part of the Michigan Test of English Language Pro- 
hciency. Because the test was administered a number of times in 1976, sev- 
eral different topics were used (see Table 2). i 

more than 3 points which strongly suggests that they were applying similar 
standards in evaluating the compositions. Small differences in grades were 
averaged to arrive at a single, final grade based on the raters' general impres- 
sions of its quality, Good eqrnpositions were tho$e that had been assigned 
grades of 79 and above, with a men of 80.6. Poor compositions had received 
scores of 68-71, with a mean of 70.2. 

The compositions were all of different lengths and were read in the form 
that the students had submitted them. Physical appearance, then, may have 
contributed to a halo effect — the tendency to assign a score partially on the 
basis of the writer's more appropriate use of the conventional mechanics of 
writing, such as punctuation and spelling (see Braddock, Lloyd- Jones, and 
Schoer 1963). 

Non-parametric (distribution-free) procedures were tised to statistically 
analyze the data. 1 /The Wilcoxon test, one of the most powerful rank tests, 
was used to convert raw scores to z scores — their respective number of standard 
deviation units from the mean. In general, this study analyzed such things as 
length of various types of writing units (e.g. T-units 2 , clauses, and sentences), 
incidence of passive voice/ 4 types of clauses, and types of joining devices be- 

1 1 am indebted to Arthur Roth and Kenneth Kaminsky, Syracuse University statisticians, 
for their help with the statistical analysis. 

' 2 A T-unit is "one main clause plus all the subordinate clauses attached to or embedded 
witnii* it" ( Hunt 1965:49). Thus, The men are standing, on the corner t and they are laughing 
it considered two T-units, with the first T-unit containing 7 words, ending at the comma, an*! 
the second, containing 4 words, beginning with and including the conjunction and. However, 
in The men tiho are standing on the corner are laughing, there is only one T-unit consisting 
of iO words. m . m 

* Passive voice is defined here as the true or dynamic passive as opposed to the stative 
passive Examples, such as the following, were counted: They wtre given the notice and 
He had been told to sit down. The presence or absence of a fty-phrase had no bearing on 
those passive constructions chosen for use in this study. Stative passive examples, such as 
the following, were not counted: / am interested in the tesults and My- coat if torn. Such 
distinctions were made in other s» Jies to which this study is compared. 
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TABLE 2 
Topics Assigned 



"Good" Writers "Poor" Writers 

Topic m Writing on Topic Writing on Topic 



( 1 ) The U.N. should pass a law , 15 14 
limiting a woman to a maximum 

of 3 children or 3 pregnancies 
(whichever comes first). Do you agree 
or disagree? Explain your answer. 

(2) Who do you consider the most 2 4 
important person who has ever lived? 

Give a short biography of the 
person and explain why you have 
chosen him/her. 

(3) ' Which event in history has had 4 « 3 

an- important effect on vour country? 
Describe this event and how it has 
changed and/or influenced the progress 
and development of your country. 

(4) All men and women should !>e I 1 
required to seevo tjieir country * 

either in the army or tn some 
other government-designated way 
for a minimum of 2 years. Do 
you agree or disagree'* 1 Present 
arguments to support your position, 

(5) W ; hat is the value of education 2 .1 
to the individual and to society 

in your country? Give examples, 
reasons, and supporting arguments 
in your composition. 

(6) Do you believe that iMfesty is 0 1 
a quality that government officials 

should have'** If so, why? 
If not, why not? 

(7) International students who are i 1 
. sponsored have itn obligation to 

return to their own countries 
after they have finished their 
training in the V.S, Do you agreV 
or disagree? Develop your v lews 
giving reasons and supporting 
arguments. 



tween writing units. * 

A two-sided rejection region of one half the test statistic was used, and 

the criterion figure for significance within this 'level of confidence was ±1,9& 

The Bonferroni inequality tesi, with a criterion figure of ± 3.025, was then 

applied to determine if good writers iri the sample differed from poor vVhters 

in terms of simultaneous factors— factors which form, a cluster and which, 

when taken together, characterize the writing of a certain group, 

* 

2. Conclusions 

0 

The Wilcoxon test repealed that 15 of the 40 factors wen* significant ,3s 
individual /actors differentiating !>etween the writing of good and poor writers, 
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The Bonferront inequality test revealed that 12 of these factors were simul- 
taneously significant. Twenty-five factors were nonsignificant, and the 12 simul- 
taneously significant factors naturally collapsed into three larger, more general 
jories: (1) T-unit length, (2) clause length, and (&) incidence of passive 
voi& Table 3 displays the statistical results for the 40 factors considered in 
study. 

TABLE 3 

Syntactic factors 




Factor 



A. Simuitaneoiisly significant factors: 

1. ' NumbeVoflong T-units (21+. words) • 

2. NuffibeAof dynamic passives 

3. Number oi sentences with at least 1 passive 

4. Number ol words in non-fragment sentences 

5. 1'assives pet clause 

6. Words per clause 

7. Passives per T-unit 

8. Passives per non-fragment sentences 

9. Words per T-unit 

10. Number of passives in mam clauses 

11. Rati' * long T-units to all T-umts 

12. Nun. of passives in secondary clauses 

B. Individually significant factors: * 

13. Ratio of short' T-units to all T-nmts 

14. Number of words pe r non-fragment seuteiurs 

15. Number of short T-umU (1-8 words) 

C. Non-significant factors, 

16. Number of secondary* clause* 

17. Number of words in fragments 

18. Number of '.buses 

19. Number' of adverbial clauses 

20. Number of T-units joined byvauf/, hut, or 

21. Number of non-fragment sentences 

22. Number of noun clauses 

23. Ratio of #20 to all T-units 

24. Number of actives in secondary clauses 

25. Ratio of #26 to all sentences 

u 26. Number of sentence begun with and, hut, ot 

27. Number of T-umts (= main clauses) 

28. Clauses per T-unit 

29. Ratio of secondary to all clause* 

30. /Number of actives in main clauses 

31. Ratio of adjectival to all seeondnrs clauses 
32/ Niunlier of fragment sentences 

# 33. Ratio of nominal to all secondary clauses 

*34. Ratio of nikHengtb to all T-umts 

35, Number of adjectival clauses 

36, Number oi inid-length T-units (9-20 words) 

37, Number of T-units joined liy ? { , ) 

* 38. Ratio of adverlmd to all secondarv clauses 

39. Ratio of #37 to all T-umts 

40. Number of active , 



"Good" 


"Poor" 


z .score for 


Writers 


. Writers 


"Poor" 


Mean 


Mean 


Writers 


5.00 


2 24 


-4.71 


3.40 


.50 


-4.69 


3.10 


.50 


-4.65 


294.20 


. 203.40 


-4.49 


.12 


.03 


-4.47 


10.83 


8.54 


-4.45 


.fco 




-4.39 


.24 


.on 


-138 


18 40 


14.30 


-.3.62 


2 10 


.24 


-3.50 


.Hi 


.10 


-3 41 


1 2*1 


.28 


-312 


U 


26 


2.77 


2 3 00 


1*8.30 


-2.50 


i 90 


5.04 


2.16 


i r oo 


0.10 


— 1 18 


1 .30 


4 20 


1.39 


27.30 . 


2 1.80 


— 1.23 


1 14 


3 64 


— 1.21 


2.04 


2 76 


1.14 


1 3*40 


11 in 


i i *i 


A 50 


2»7() . 


-101 


12 


.16 


0 87 


9 7fi 


8 70 


-0.82 


03 


.01 


-0 78 


10 


20 


-0 72 


1030 


15.70 


-0.72 


1 71 


1 06 


-0.67 


10 


37 


-0 02 


1 1 10 


15 30 


0 62 


28 


33 


0.59 


01 


20 


0 18 


20 


27 


-0 IS 


53 


.55 


0 47 


3 04 


2 68 


-0 29 


8.81 


8 if 


0 16 


.72 


.60 


0.14 


12 


40 


-0 12 


01 


04 


0 09 


2 3 00 


2} 00 


00 3 



Clearly, there is a correlation between incidence of certain syntactic factors 
and scores assigned to compositions written by col lejp'- level ESL .students, thus 
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supporting, the alternative hypothesis and rejecting the null hypothesis. The 
most interesting findings involve T-unit length, clause length, sentence length, 
-and incidence of passive voice. 

T-unit length, especially incidence of long T-units (21+ words), i? ap- 
parently a powerful index for differentiating between the writing of good and 
poor college-level ESL writers. Good yriters wrote approximately 29* more 
.words in each T-unit, writing an average of 18.40 words/T-unit as opposed to 
• 14.30 for poor writers. 

This finding correlates with the findings of three other researchers. Hunt 
(1965), in his analysis of the writing of 4th-, 8th-, 12th-graders and professional 
adults, found T-unit length to be the most reliable index of syntactic* maturity 
through the grades. Potter (1967), who analyzed the writing of good and poor 
lOth-grade writers, found that good writers wrote longer T-units than poor 
writers. In addition, Schmeling (1969), in his analysis of college freshman 
, writing, found that there was a significant difference in T-unit length between 
first and improved versions of quality-differentiated compositions. 4 

Mean clause length, found by Hunt (1965) to Ue the second m<fct power- 
ful indicator of syntactic maturity, correlated significantly with rated quality 
of writing among the college-level ESL writers in this study, but not among 
the college freshman writers in Schmeling s (1969) study. Of equal importance, 
however, the results of this study indicate that there were no significant dif- 
ferences between goou and poor college-level ESL writers either in* terms of 
number of clauses attached to or embedded within a T-unit (clauses/T-unit) 
or in terms ot mean incidence of various types of clauses — nominal, adjectival, 
or adverbial. These findings have widespread implications, pointing us in the 
direction of the types of exercises from which "poor" writers may benefit, and 
away from those from which "poor" writers will most likely not benefit. 

First, the commonly-held intuition that "good" writers have a superior 
command of the use of subordinate clauses, allowing them to embed more 
clauses of various types within ? main clause matrix than do "poor" wipers, 
is in jio way supported by this study, thus inviting us to question the practice 
of emphasizing, in the ESL composition classroom, the mastery 0 f the tech- 
niques of subordination. Obviously, "good" writers earned their higher quality 
ratings and "poor" writers their lower quality ratings for reasons other than 
their ability to write subordinate clauses. 

Second, since it was clearly an increase in the number of words in each 
clause, rather than an increase in the number of clauses, that accounted for the 
fact that "good" writers wrote significantly longer T-units than did "poor" 
writers, 4 poor" writers may benefit from exercises in sentence combining. And 
with sentence combining exercises— exercises which present the students with 
numerous short sentences directing them to combine them into longer, more 

4 These were compositions that were first corrected by a rater, returned to the student, 
ami then resubmitted in revised form Sehmelmg then analyzed s\ntactic differences between 
the first writing and second writing of onl> those compositions that had lieen rated improved. 
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economical units (see Kaniecn 1978)— we can give students practice in writing 
longer clauses and T-units, particularly by showing them how to reduce full 
clauses to prepositional, infinitival, and participial' phrases. By our showing 
"poor" writers the various techniques for reducing and consolidating larger and 
.larger chunks of information into fewer (but longer) clauses, we will be lead- 
ing these students in the direction of reduced redundancy and increased suc- 
cinctness (see Hunt 1965), thus helping 'poor writers" write more like their 
"good" writing counterparts. 

SetUetice Length A surprising result, and one which challenges on<? of 
the most commonly-held intuitions about student writing, is that sentence length 
may not be as powerful an index for differentiating between "gcxKi" and "poor" 
writers as has been previously thought. Of the 15 factors found to be indi- 
vidually significant, ftie sentence length index placed just above the cut-off 
point for lion- significance. This finding is not completely surprising, however, 
in light of the fact that three other investigators. Hunt. Potter, and Schmeling, 
found sentence length to be a very unreliable indicator of either syntactic 
maturity or rated quality among native Fnglish speaking students. Thus, with 
the decidedly more powerful indexes of T-imit length and clause length at 
our disposal, it would be wise to carefully re-examine am intuitive trust we 
may have in the discriminating power of the sentence length index. 

Passite Voice. Perhaps the most iuteiesting lesult of this study involves 
incidence of passive voice Since it has long been thought that the use of pas- 
sive voice leads to ineffective writing, it is indeed surmising that there is such 
a high positive correlation between incidence of passive voice and scores as- 
signed to compositions written by college-level KSL students. Specifically, 
"good"' writers wrote approximately 64 times as many passives as did "poor" 
writers, the former using a passive in 252 of their sentences and the latter in 
only 62 of their sentences, In addition, "good" vVriters wrote 4 times as many 
passives "per clause, 4 times as m:uiy per T-iinit. 0 times as many per main 
clause, and approximately 4H times as many per secondary clause*. The differ- 
ences, regardless of writing unit, arc 1 certainly ch ar cut Ai.J while all 7 factors 
dealing with passive voice proved to be significant, all 3 factors involving in- 
cidence of active voice were non- significant. 

It is surprising that prejudices against the passive continue to flourish, for 
data from other studies correlate with the findings of this stud). Hunt (1965) 
found that mean incidence of passive voice inci eased through the grades, cor- 
relating at the .01 level of confidence 1 with age. Mighth- graders wrote approxi- 
mately 3 times as many passives as did 4th-graders. and 12 gradeis wrote ap- 
proximately 4 times as man) passives as did 4th -giaders. Potter (1967) found 
that "good" lOth-grade writers wrote twice as many passives as did "poor" lOtli- 
grade writers. Similarly, Schmeling ( 1969) found that among qualitv -differ- 
entiated papers, improved" versions of papers contained approximately twice 
as many passives as did first 4 versions. Finally. Wolk (1969). in bis analysis of 
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the writing of 10 professional authors, found that nearly one verb in six was 
in the passive and that nearly one sentence in three contained at least one 
passive. / 

In short, many different types of writers 'of different "ages and language 
backgrounds use the passive voice quite frequently, with 'good' 1 and older 
writers using more passives than do "poor" and younger writers. Mean incidence 
of passive voice, then, seems to be a reliable indicator of both syntactic ma- 
turity and rated quality of writing, all this suggesting that we should thoroughly 
re-examine our present attitude toward teaching the passive, for there is simply 
no apparent justification for our so completely discouraging its use. 

Discussion. It is clear that the results of this study have a number of spe- 
cific and general implications for the ESL "composition curriculum. 5 First, in 
terms of length of writing units, T-unit length and clause length appear to be 
much more reliable indexes of rated quality than* is the time-honored index of 
sentence length Statistical counts of these syntactic factors, then, may be used 
as one part of the overall procedure for properly placing ESL composition stu- 
dents, using these indexes for the .same- purposes as other objective measures 
are used in education: "prediction, . . . exemption, growth measurement, (and) 
program evaluation" (Cooper 1977:16). 

Second, the much-maligned passive voiee, long considered one of the pri- 
mary ingredients in the recipe for dull, ineffective writing, apparently does not 
greatly detract from writing quality, at least in the dialect of formal, expository 
prose — the type of prose which the assigned topics -were designed to elicit, and 
exactly the type of prose that we are instructing oUr students to write in When 
they come to our composition elas.se*. The final answer to the question of tvhy 
quality of writing correlar-d so highly with incidence of passive cannot be 
definitively answered hen hut perhaps it is that higher 'incider.ee of passive 
voiee indicates a greater control over the syntactic structures of the language, 
in some general sense leaving the reader with the impression that the writer 
has a higher level of proficiency in the written language. Whatever the answer 
to this question may Ije. it is clear that we need to teach our students that, in 
the dialect of formal expository prose, the passive is not simply an unattractive 
transform of the active, but rather a valuable tool in the arsenal of "good" 
writers. 

Third, by our knowing more about what "good" writers put into their 
writing, and what "poor* 4 writers do not, we will be better able to design ex- 
ercises to guide "poor" writers in the appropriate direction. Sentence combining 
exercises would appear to be particularly useful in accomplishing this goal, 
while exercises that teach the techniques of clause subordination do not appear 
to be useful. 

Finally, it is a solid body of empirical data, not vague intuitions, that will 

• % The limited luitme of this stud> does not iinph th.it switax is the whole of writing, for 
such things as content, development, oigani/ntioti. naturalness of expression and audience 
rtf ret i\ mess ail plu> a role in determining the quality of a eomp<,sitiou, 

o 
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more clearly reveal the nature of the relationship between ^syntactic skill and 
ESL writing quality. When we have accumulated this body of experimeataL__^ 
data, our job of teaching writing to ESL students/ and the students' job of 
learning to write, may become a great deal less difficult, and perhaps more 
enjoyable. 
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